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Screening process for hydantoin racexnases 

The present invention relates to a screening process for 
the detection of improved hydantoin racemases, to novel 
hydantoin racemases themselves and to their use in the 
5 preparation of N-carbamoyl -amino acids. 

These optically active compounds are compounds that are 
frequently used in organic synthesis for the preparation 
of, for example, active ingredients having biological 
activity • They are also present in chiral auxiliaries, for 
10 example in the form of the amino alcohols (Evans reagents) . 

The enzymatic hydrolysis of 5-substituted hydantoins to N- 
carbamoyl- amino acids and the further reaction thereof to 
the correisponding encmtiomerically enriched amino acids is 
a standard method in organic chemistry ('"Enzyme Catalysis 

15 in Organic Synthesis", Eds.: Drauz, Waldmann, VCH, 1^^ and 
2^^ Ed.)- The enantiodif f erentiation can be carried out 
either at the stage of the hydantoin hydrolysis by means of 
hydantoinases or alternatively during the cleavage of the 
N- carbamoyl -ami no acids by means of enanti 6s elective 

20 carbamoyl as es . Because the enzymes each convert only one 
optical antipode of the corresponding compound, it is 
attempted to racemise the other in the mixture (in situ) in 
order to ensure the complete conversion of the hydantoin, 
which can readily be prepared racemically, into the 

25 corresponding enantiomerically enriched amino acid. The 
racemisation can proceed either at the stage of the 
hydantoins by means of chemical (base, acid, elevated 
temperature) or enzymatic processes or alternatively at the . 
stage of the N- carbamoyl -amino acids by means of, for 

30 example, acetylamino acid racemases (DE10050124) . By its 
nature, the latter variant is only successful if 
enantios elective carbamoylases are used. The following 
scheme illustrates this fact- 
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Scheme 1 : 
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For aromatic substrates, the rate of the chemical 
racemisation of the hydantoins, as shown in Table 1, is 
sufficiently high to ensure high space-time yields for the 
preparation of amino acids by the hydantoinase process • For 
aliphatic hydantoins, such as isobutyl-, methyl- and 
isopropyl -hydantoin, however, the racemisation represents a 
considerable bottleneck in the synthesis of aliphatic amino 
acids . 
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Table 1: Racemisation constants of hydantoins at 40^C, 
pH 8.5 determined by initial rates according to a first- 
order reaction (-Jcrac = ln([a]/[ao]) from: Hydrolysis and 
Formation of Hydantoins (Chpt. B 2.4), Syldatk, C. and 
5 Pietzsch, M. In: Enzyme catalysis in organic synthesis 
(Eds.: K. Drauz & H. Waldmann) , VCH, 1^*" and 2"^^ Ed.). 



5 ' -substituent 


krac (h-^) 


ti/2 (h) 


Phenyl 


2.59 


0.27 


Methyl thioethyl 


0.12 


5.82 


Isobutyl 


0.032 


21.42 


Methyl 


0.02 


33.98 


Isopropyl 


0.012 


55.90 



This problem manifests itself, for example, in the 
preparation, described in EP759475, of enantiomerically 

10 enriched tert . -butylhydantoin by means of the hydantoinase 
process. In this case, the complete conversion of 32 mM 
tert . -butylhydantoin with 1.5 kU R-hydantoinase required 
. 8 days at pH 8.5 and 4 days at pH 9.5. The low space- time 
yield is in fact caused by the only slow chemical 

15 racemisation of tert . -butylhydantoin {krac - 0.009 h"^ at 
50^C and pH 8.5) . 

There are known from the prior art hydantoin racemases from 
microorganisms of the genus Pseudomonas , Microbacterium, 
Agrobacterium and Arthrobacter (lit.: JP04271784; 

20 EP1188B26; Cloning and characterization of genes from 
Agrobacterium sp. IP 1-671 involved in hydantoin 
degradation , Hils , M . ; Muench , P . ; Al tenbuchner , J . ; 
Syldatk, C; Mattes, R. Applied Microbiology and 
Biotechnology (2001), 57(5-6), 680-688; A new razemase 

25 for 5-monosubstituted hydantoins. Pietzsch, Markus; 
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Syldatk, Christoph; Wagner, Fritz. Ann. N. Y. Acad. Sci. 

(1992), 672 (Enzyme Engineering XI) , 478-83. Lickefett, 
Holger; Krohn, Karsten; Koenig, Wilf ried A. ; Gehrcke, 
Barbel; Syldatk, Christoph. Tetrahedron: Asymmetry (1993), 
4(6), 1129-35; Purification and characterization of the 
hydantoin razemase of Pseudomonas sp. strain NS671 
expressed in Escherichia coli. Watabe, Ken; Ishikawa, 
Takahiro; Mukohara, Yukuo ; Nakamur a , Hiroaki . J. Bacterid. 

(1992), 174(24), 7989-95). 

Of the hydantoin racemases from Artiir-oibacter* SLurescens 
DSM 3745, Pseudomonas sp. NS671 and Jficrojbacteritzni 
liQuefaciens, it is known that these enzymes racemise 
aliphatic hydantoins , such as, for example, isopropyl- 
hydantoin or isobutylhydantoin, . only weakly . It is also 
known that the hydantoin racemases from Artiar-oJbacter 
aurescens DSM 3747 preferentially convert aromatic 
hydantoins, such as indolylmethylhydantoin or benzyl - 
hydantoin, whereas aliphatic hydantoins, such as methyl- 
thioethylhydantoin, are converted comparatively ' weakly or, 
in the case of isopropylhydantoin, are not convierted at all 
(A^ new razemase for 5-monosubstituted hydantoins. Pi^tzsch, 
Mairkus; Syldatk, Christoph; Wagner, Fritz. Ann.' N. Y. Acad. 
Sci; (1992), 672 (Enzyme Engineering XI), 478-83.). 

The low activi^ty of hydantoin racemases therefore 
frec[uently .limits the economic potential of this route 

In order to enable as many hydantoin racemases as possible 
tp be checked in a suitable time for their potential to 
racemise aliphatic hydantoins, the object of the present 
invention was inter alia to provide a suitable screening 
process for hydantoin racemases. Moreover, the screening 
process according to the invention should be usable as a 
component of a mutagenesis process for obtaining new and 
improved hydantoin racemases. It was also an object of the 
present invention to provide novel hydantoin racemases 
which are superior to the hydantoin racemases of the prior 



wo 2004/111227 



PCT/EP2004/005239 



5 

art at least in terms of selectivity and/or activity and/or 
stability. 

This object is achieved according to the claims. Claim 1 
relates to a screening process for hydantoin racemases . 
Dependent claims 2 to 4 indicate advantageous embodiments 
of the screening process. Claim 5 is concerned with a 
mutagenesis process for the preparation of novel hyciantoin 
racemases using the screening process according to the 
invention. Claims 6 to 11 relate to novel hydantoin 
racemases and to the nucleic acid sequences coding therefor 
and their use. Claims 12 to 14 are directed towards 
vehicles containing the hydantoin racemases according to 
the invention, or particular primers for their preparation. 

By the provision of a screening process for hydantoin 
racemases, in which . 

a) an enantios elective hydantoinase and 

b) the hydantoin racemase to be tested, which has a slower 
conversion rate compared with the hydantoinase xinder a) , 
are allowed to act on 

c) a chiral hydantoin, which is used in the opposite 
enantiomerically enriched form to the selectivity of the 
hydantoinase, and 

d) the resulting N- carbamoyl -amino acid or the freed 
protons are detected in a time-dependent manner, 

it becomes possible in a surprisingly simple and yet 
advantageous manner to check a large number of hydantoin 
racemases in a short time for their ability to racemi.se 
hydantoins in an improved manner. 

By the use of an L-enantiomer of a 5 ' -monosubstituted 
hydantoin and the use of a D-selective hydantoinase which, 
on the basis of its enantioselectivity, preferably rapidly 
hydrolyses the resulting D-eneuitiomer of the hydantoin, the 
racemisation rate and hence the activity of the hydantoin 
racemase can be measured in a simple manner by the 
formation of the N- carbamoyl -D-amino acid or by freed 
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protons. The carbamoyl -amino acid can be quantified by- 
methods knovm to the person skilled in the art, such as, 
for example, HPLC or colorimetric methods, Queintif ication 
via protons can be carried out in a simple manner via pH 
5 indicators, preferably cresol red. It should be noted that 
both D- and L-enantiomers of hydantoins having different 
optionally aliphatic 5 • -substituents can be used in the 
process . When D-hydantoins are used, corresponding L- 
selective hydantoinases are to be used in the screening 
10 process . 

In the process according to the invention there are 
advantageously used aliphatic hydantoins substituted in the 
5 '-position. In this context, aliphatically substituted 
hydantoins are understood to mean a system which has in the 
15 5 • -position on the hydantoin heterocycle a radical which is 
bonded to the heterocycle via a carbon atom having sp^- 
hybridisation. Preferred 5 • -substituents are methyl, ethyl, 
butyl , propyl , tertiary butyl , isopropyl and isobutyl . 
Ethylhydantoih is very particularly preferred. 

20 There may be used as hydantoinases any hydantoinases known 
in the literature which enantioselectively hydrolyse the 
hydantoin enantiomer formed via the hydantoin racemase, it 
being necessary for this hydrolysis to be more rapid than 
the racemisation rate. Preferred hydantoinases are the 

25 commercial hydantoinases 1 & 2 from Roche, the 

hydantoinases of the genera Agrobacterium, Arthrobacter, 
Bacillus , Pseudomonas, Flavobacterium, Pasteurella, 
Microbacterivm, Vigna, Ochrobactrum, Methanococcus , 
Burkholderia and Streptomyces. (Hils, M. ; Muench, P.; 

30 Altenbuchner, j.; Syldatk, C; Mattes, R. Cloning and 

characterization of genes from Agrobacterium sp. IP 1-671 
involved in. hydantoin degradation. Applied Microbiology and 
Biotechnology (2001), 57(5-6), 680-688. Soong, C.-L.; 
Ogawa, J.; Shimizu, S. Cyclic ureide and imide metabolism 

35 in microorganisms producing a D-hydantoinase useful for D- 
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amino acid production. Journal of Molecular Catalysis B: 
Enzymatic (2001), 12(1-6), 61-70. Wiese, Anja; Wilms, 
Burkhard; Syldatk, Chris toph; Mattes, Ralf; Altenbuchner , 
Josef. Cloning, nucleotide sequence and expression of a 
5 hydantoinase and carbamoylase gene from Arthrobacter 

aurescens DSM 3745 in Escherichia coli and comparison with 
the corresponding genes from Arthrobacter aurescens DSM 
3747: Applied Microbiology and Biotechnology (2001), 
55(6), 750-757. Yin, Bang-Ding; Chen, Yi-Chuan; Lin, Sung- 

10 Chyr; Hsu, Wen-Hwei. Production of D-amino acid precursors 
with permeabilized recombinant Escherichia coli with D- 
hydantoinase activity. Process Biochemistry (Oxford) 
(2000), 35(9), 915-921. Park, Joo-Ho; Kim, Geun-Joong; 
Lee, Seung-Goo; Lee, Dong-Cheol; Kim, Hak-Sung. . 

15 Purification and characterization of thermostable D- 

hydantoinase from Bacillus thermocatenulatus GH-2 . Applied 
Biochemistry and Biotechnology (1999), 81(1), 53-65; 
Pozo, C; Rodelas, B.; de la Escalera, S.; Gonzalez-Lopez, 
J. D, L-Hydantoinase activity of an Ochrobactriam anthropi 

20 strain- Journal of Applied Microbiology (2002), 92(6), 
1028-1034; Chung, Ji Hyung; Back, Jung Ho; Lim, Jae-Hwan; 
Park, Young In; Han, Ye Sun. Thermostable hydantoinase 
from a hyper thermophilic archaeon, Methanococcus 
jannaschii. Enzyme and Microbial Technology (2002), 

25 30(7), 867-874; Xu, Zhen; Jiang, Weihong; Jiao, Ruishen; 
Yang, Yvmliu. Cloning, sequencing and high ea^ression in 
Escherichia coli of D-hydantoinase gene from Burkholderia 
pickettii. Shengwu Gongcheng Xuebao (2002), 18(2), 
149-154; Las Heras- Vazquez, Francisco Javier; Martinez- 

30 Rodriguez, Sergio; Mingorance-Cazorla, Lydia; Clemente- 
Jimenez, Josef a Maria; Rodriguez-Vico, Felipe. 
Over express ion and characterization of hydantoin racemase 
from Agrobacterium tumefaciens C58. Biochemical and 
Biophysical Research Communications (2003), 303(2), 

35 541-547; DE 3535987; EP 1275723; US 6087136; WO 0281626; US 
2002045238; DE 4328829; WO 9400577; WO 9321336; JP 
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04325093; NL 9001680; JP 2003024074; WO 0272841; WO 
0119982; WO 9620275). 

The use of the hydantoinase from ArthrobBctGr 
crystallopoietes , especially from DSM 20117, is very 
5 particularly preferred. 

As already indicated, the conversion rate of the 
hydantoinase should be superior to that of the racemase. 
The ratio of the rate constants of the hydantoinase to the 
hydantoin racemase (khyd/krac) is preferably > 2, 
10 particularly preferably > 10 and very particularly 
preferably > 50. 

The invention also provides a process for the preparation 
of improved hydantoin racemases, which is distinguished by 
the fact that 

15 a) the nucleic acid sequence coding for the hydantoin 
racemase is subjected to a mutagenesis, 
b) the nucleic acid sequences obtainable from a) are cloned 
into a suitable vector and the vector is transferred 
into a suitable expression system, and 
20 c) the resulting hydantoin racemases having improved 

activity and/br selectivity and/or stability aire detected 
by means of a screening process according to the invention 
and isolated. 

There may be' used as Starting genes for the mutagenesis of 
25 the hydantoin racemases any known hydantoin racemase genes 
mentioned in the listed literature. Preference is given to 
the hydantoin racemase genes of Arthobacter, Pseudomoxias , 
Agrobacterium and Micrococcus (Wiese A; Pietzsch M; Syldatk ' 
C; Mattes R; Altenbuchner J Hydantoin racemase from 
30 Arthrobacter aurescens DSM 3747: heterologous expression, 
purification and characterization. JOURNAL OF 
BIOTECHNOLOGY (2000 Jul 14), 80(3), 217-30; Watabe K; 
Ishikawa T; Mukohara Y; Nakamura H Purification and 
characterization of the hydantoin racemase of Pseudomonas 
35 sp. strain NS671 expressed in Escherichia coli . JOURNAL OF 
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BACTERIOLOGY (1992 Dec), 174(24), 7989-95; Las Heras- 
Vazquez, Francisco Javier; Martinez-Rodriguez, Sergio; 
Mingorance-Cazorla, Lydia; Clemente- Jimenez, Josef a Maria; 
Rodriguez-Vico, Felipe. Overexpression and 
5 characterization of hydantoin racemase from Agrobacterium 
tumefaciens C58. Biochemical and Biophysical Research 
Communications (2003), 303(2), 541-547; EP 1188826), 
Very particular preference is given to the hydantoin 
racemase gene from Arthrojbacter aurescens , which codes for 

10 the protein sequence in Seq,ID.No. 2. 

For the mutagenesis of the hydantoin racemase there may be 
used amy methods known in the literature, such as, for 
example, random mutagenesis, saturation mutagenesis, 
cassette mutagenesis or recombination methods (May, Oliver; . 

15 Voigt, Christopher A.; Arnold, Frances H. Enzyme 

engineering by directed evolution. Enzyme Catalysis in 
Organic Synthesis (2nd Edition) (2002), 1 95-138; 
Bio/Technology 1991, 9, 1073-1077; Horwitz, M, and Loeb, 
L., Promoters Selected From Random DNA-Sequences , Proc Natl 

2.0. Acad Sci USA 83, 1986, 7405-7409; Dube, D. and L. Loeb, 
Mutants Generated By The Insertion Of Random 

Oligonucleotides Into The Active-Site Of The Beta-Lactamase 
Gene, Biochemistry 1989, 28, 5703-5707; Stemmer, P.C., 
Rapid evolution of a protein in vitro by DNA shuffling, 
25 Nature 1994, 370, 389-391 and Stemmer, P.C., DNA shuffling 
by random fragmentation and reassembly: In vitro 
recombination for molecular evolution. Proc Natl Acad Sci 
USA 91, 1994, 10747-10751). 

The cloning and expression can be carried out as in the 
30 literature mentioned hereinbelow. The process can be 

carried out several times in succession, optionally with 
varying mutagenesis strategies. 

The invention also provides rec-polypeptides or the nucleic 
acid sequences coding therefor, which are obtainable by the 
35 mutagenesis process mentioned above. 
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Another aspect of the invention is the use of the 
polypeptides so prepared in the preparation of chiral 
enantibmerically enriched N- carbamoyl -amino acids or amino 
acids. The nucleic acid sequences prepared according to the 
5 invention can be used in the preparation of whole cell, 
catalysts. 

Hydantoin racemases which have in position 79 an amino acid 
substitution with an amino acid selected from the group 
consisting of A, R, D, C, E, H, 1, L, K, M, F, P, S, 

10 T, Y and V also form part of the present invention. It is 
interesting that the amino acids surrounding this position 
are retained completely for many hydantoin racemases . The 
consensus sequence reads: FX1DX2GL (Seq. ID.No. 1), wherein 
X2 represents P or T and Xi represents W or G. Preferred 

15 mutants therefore contain the above-mentioned consensus 

sequence, Xi preferably representing an amino acid selected 
from the group consisting of A, R, N, D, C, Q, E, H, 1, L, 
K, M, F, P, S, T, Y and V. Xi corresponds to position 79. 
Preferred mutemts are shown in Table 2 . 

20 Table 2: 



Mutant name^ 


■Mutation 
(codon) 


Mutation Xi 
(amino acid) 


Activity 
change 


Seq.. ID, 
No. 


3CH11 


GQG -> GAG 


G79E 


2 


5 


1BG7. 


GGG -> AG<3 


G79R 


2 


3" 


BB5 


GGG -> TTG 


G79L. 


4 


9 


AE3 


GGG -> CAG 


G79Q 


4 


7 



Further extremely advantageous combinations of Xi and X2 
hydsuitoin racemases are listed in Table 3 below. 
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Table 3: Advantageous combinations of Xi and X2 in the 
consensus motif FX1DX2GL 



Xi 


L 


E 


Q 


R 


I. 


E 


Q 


R 


Xa 


P 


P 


P 


P 


T 


T 


T 


T 



It is particularly advantageous if the hydantoin racemases 
contain the above-mentioned consensus region and 
additionally exhibit a homology of >40% with the hydantoin 
racemase from DSM 20117. 

The invention also provides isolated nucleic acid sequences 
coding for a hydantoin racemase selected from the group: 

a) a nucleic acid sequence coding for a hydantoin racemase 
according to the invention, 

b) a nucleic acid sequence which hybridises under stringent 
conditions with the nucleic acid sequence coding for a 
hydantoin racemase according to the invention or with 
the sequence complementary thereto, 

c) a nucleic acid sequence according to Seq.ID.No. 3, 5,. 7 
or 9 or a nucleic acid sequence having a homology of 

> 80% therewith, 

d) a nucleic acid sequence containing 15 successive 
nucleotides of sequences Seq, ID^No. 3, 5, 7 or 9 . 

With regard to point d) , it is preferred for the nucleotide 
sequence according to the invention to contain 20, more 
preferably 25, yet more preferably 30, 31, 32, 33, 34 and 
most preferably more than 34 identical consecutive nucleic 
acids of the sequences Seq.ID.No. 3, 5, 7 or 9. 

As mentioned, the invention also includes nucleic acid 
sequences which hybridise under stringent conditions with 
the single-strand nucleic acid sequences according to the 
invention or with their complementary single-strand nucleic 
acid sequences (b) , or nucleic acid sequences which are 
alike in sequence sections (d) . Particular gene probes or 
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the primers necessary for a PGR, for example, are to be 
regarded as such. 

Coupling of hydantoin racemase and hydantoinase and 
optionally carbamoylase can be carried out by bringing 
5 together the free or immobilised enzymes. However, it is 
preferred for the hydantoinase to be expressed in the. same 
cell together with the hydantoin racemase and/ or the 
carbamoylase (whole cell catalyst) . . 

The nucleic acid sequences according to the invention can 
10 therefore be cloned into a whole cell catalyst as a 

constituent of a gene in a manner analogous to that in 
DE102347 64 and the literature cited therein. 
Provided that the latter then also contains genes for a 
hydantoinase and/or carbamoylase, it is capable of 
15 converting racemic hydantoins completely into 

enantiomerically enriched amino acids . Without a cloned 
carbampylase gene, the reaction stops at the stage of the 

carbamoyl -amino acids. , : 
The host organism used is preferably an organism as 
20 mentioned in DE10155928. The advanta.ge of such an organism 
is , the simultaneous expression of all the enzymes involved, 
with which only a rec-organism must be used for the total 
reaction.. 

In order to match the expression of the enzymjes in respect 
25 of their conversion rates, the corresponding coding nucleic 
acid sequences can be cloned into different plasmids with 
different copy numbers and/or promoters of different 
strengths can be used for a different strength of 
expression of the nucleic acid sequences. In such matched 
30 enzyme systems, there is advantageously no accumulation of 
an intermediate compound which may have an inhibiting 
action, and the reaction under consideration can proceed at 
an optimum overall rate. This is sufficiently well Icnown to 
the person skilled in the art, however (Gellissen, G. ; 
35 Piontek, M. ; Dahlems, U.; Jenzelewski, V.; Gavagan, J. W.; 
DiCosimo, R. ; Anton, D. L.; Janowicz, Z. A. (1996), 
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Recombinant Hansenula polymorpha as a biocatalyst- 
Coexpression of the spinach glycolate oxidase (GO) and the 
S. cerevisiae catalase T (CTTl) gene, Appl . Microbiol. 
Biotechnol . 46, 46-54; Farwick, M. ; London, M. ; Dohmen, J.; 
5 Dahlems, U. ; Gellissen, G.; Strasser, A. W. ; DE19920712). 
The preparation of such a whole cell catalyst is 
sufficiently well known to the person skilled in the art 
(Sambrook, J.; Fritsch, E. F. and Maniatis, T. (1989), 
Molecular cloning: a leiboratory manual, 2"*^ ed. , Cold 

10 Spring Harbor Laboratory Press, New York; Balbas, P. and 
Bolivar, F* (1990), Design and. construction of expression 
plasmid vectors in E, coli. Methods Enzymol . 185, 14-37; 
Rodriguez, R.L. and Denhardt, D. T (eds) (1988), Vectors: a 
survey of molecular cloning vectors eind their uses, 

15 205-225, Butterworth, Stoneham) . 

In a next embodiment, the invention relates to plasmids or 
vectors containing one or more of the nucleic acid 
sequences according to the invention. 

Suitcible plasmids or vectors are in principle any forms 

20 available to the person skilled in the art for this 

purpose. Such plasmids and vectors will be found, for 
example, in Studier et al. (Studier, W. F.; Rosenberg 
A. H.; Dunn J. J.; Dubendroff J- W. ; (1990), Use of the T7 
RNA polymerase to direct es^ression of cloned genes, 

25 Methods Enzymol. 185, 61-89) or the brochures of Novagen, 
Promega, New England Biolabs, Clontech or Gibco BRL. 
Further preferred plasmids and vectors can be found in: 
Glover, D. M. (1985) , DNA cloning: a practical approach. 
Vol. I-III, IRL Press Ltd. , Oxford; Rodriguez, R.L. and 

30 Denhardt, D. T (eds) (1988), Vectors: a survey of molecular 
cloning vectors and their uses, 179-204, Butterworth, 
Stoneham; Goeddel, D. V. (1990), Systems for heterologous 
gene expression. Methods Enzymol. 185, 3-7; Sambrook, J.; 
Fritsch, E. F. and Maniatis, T. (1989), Molecular cloning: 

35 a laboratory manual, 2"^ ed. , Cold Spring Harbor Laboratory 
Press, New York. 
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Plasmids with which the gene construct containing the 
nucleic acid according to the invention can be cloned into 
the host organism in a very preferred manner are 
derivatives of pUC18 and pUC19 (Roche Biochemicals) , 
5 PKK-177-3H (Roche Biochemicals) , pBTac2 (Roche 

Biochemicals) , pKK223-3 (Amersham Pharmacia Biotech), 
pKK-233-3 (Stratagene) or pET (Novagen) . Further preferred 
plasmids are pBRi322' (DSM3879) , pACYC184 {DSM4439) and 
pSClOl (DSM6202), which can be obtained from DSMZ-Deutsche 

10 Sammlung von Mikroorgahismen und Zellkultureh GmbH, 
Braunschweig, Germany. 
. The invention is likewise directed towards microorganisms 
containing one or more nucleic acid sequences according to 
the invention. The micrborgahism into which the plasmids 

15 containing the nucleic acid sequences according to the 
invention are cloned serves to multiply sind obtain a 

sufficient amount of the recombinant enzyme. The processes 

■ > ' ... . . ...... 

therefor are well known to the person skilled in the art 
(Sambrook, J.; Fritsch, E. F. and Maniatis, T. (1989), 

20. Molecular cloning: a laboratory manual , 2"^ ed. , Cold 

Spring Harbor Laboratory Press, New York) .. In principle, 
there can be used as microorganisms any organisms suitable 
to the person skilled in the art for this purpose, such as, 
for example, yeasts, such as H^nsGnula polymojrpha, Pichia 

25 sp., Saccharomyces cerevisiae, prokaryotes, such as E. 

coli. Bacillus subtilis, or eukaryotes, such as mammalian 
cells, insect cells. coli strains are preferably to be 
used for this purpose. Very particular preference is. given 
to: E. coji XLl Blue, W3il0, DSM14459 (PCT/USOO/08159) , MM 

30 522, JMlOl, JM109, JM105, RRl, DH5a, TOP 10" and HBlOl. 

ft 

Plasmids with which the gene construct containing the 
nucleic acid according to the invention is preferably 
cloned into the host organism are indicated above. 

A following aspect of the invention is directed towards 
35 primers for the preparation of the gene sequences according 
to the invention by means of any type of PGR. Included are 
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the sense and antisense primers coding for the 
corresponding amino acid sequences, or complementary DNA 
sequences . Suitable primers can in principle be obtained by 
processes known to the person skilled in the art. The 
5 location of the primers according to the invention is 

carried out by comparison with known DNA sequences or by 
translation of the amino acid sequences under consideration 
into the preferred codon of the organism in question (e.g. 
for Streptomyces : Wright F. and Bibb M. J. (1992), Codon 

10 usage in the G+C-rich Streptomyces genome. Gene 113, 

55-65) . Similarities in the amino acid sequence of proteins 
of so-called superf amilies are likewise of use therefor 
(Firestine, S. M. ; Nixon, A. E.; Benkovic, S. J. (199.6)., 
Threading your way to protein function, Chem. Biol. 3, 

15 779-7 83) . Further information hereon can be found in Gait, 
M. J. (1984), Oligonucleotide synthesis: a practical 
approach, IRL Press Ltd., Oxford; Innis, M. A.; Gelfound, 
D. H.; Sninsky, J. J. and White, T.J. (1990), PGR 
Protocols: A guide to methods and applications. Academic . 

20 Press Inc., San Diego. 

Preferred primers are those of Seq.ID.No. 11 and 12. 

As already indicated, the enzymes under consideration 
(hydantoin racemase, hydsuitoinases and/or carbamoyl as es ) 
can be used in free form as homogeneously purified 

25 compoxonds or as an enzyme prepared by recombinant methods 

(rec-) . The enzymes may also be used as a constituent of an 
intact guest or gsinism oir in conjunction with the cell mass 
of the host orgcinism which has been opened up and highly 
purified as desired. 

30 It is also possible to use the enzymes in immobilised form 
(Sharma B. P.; Bailey L. F. and Messing R. A. (1982), 
Immobilisierte Biomaterialiern - Techniken und Anwendungen, 
Angew. Chem. 94, 83 6-852) . Immobilisation is preferably 
carried out by lyophilisation (Paradkar, V. M. ; Dordick, J. 

35 S. (1994), Aqueous-Like Activity of a- Chymo trypsin 

Dissolved in Nearly Anhydrous Organic Solvents, J. Am. 
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Chem, Soc. 116, 5009-5010; Mori, T. ; Okahata, Y. (1997), A 
variety of lipi-coated glycoside hydrolases as effective 
glycosyl transfer catalysts in homogeneous organic 
solvents, Tetrahedron Lett. 38, 1971-1974; Otamiri, M. ; 
5 Adlercreutz, P.; Matthiasson, B. (1992), Complex formation 
between chymotrypsin and ethyl cellulose as a means to 
solbilize the enzyme in active form in toluene, 
Biocatalysis 6, 291-305) . Very special preference is given' 
to lyophilisation in the presence of surface-active 

10 substances, such as Aerosol OT or polyvinylpyrrolidone or 
polyethylene glycol (PEG) or Brij 52 ( diethyl ene glycol 
. monocetyl ether) (Kamiya, N; ; Okazaki, S.-Y.; Goto, M. 
(1997), Surfactant-horseradish peroxidase complex 
catalytically active in anhydrous benzene, Biotechnol* . 

15 Tech. 11, 375-378) : - 

Very special preference is given to immobilisation on. . 
Eupergit®, especially Eupergit C® and Eupergit 250li® (Rohm) 
(Eupergit .RTM. C, a carrier for immobilization. of enzymes 
of industrial potential . Katchalski-Katzir:, E. ; Kraemer, D, 

20 Journal of Molecular Catalysis B: Enzymatic (2000), 

10(1-3), 157-176.) 

Also preferred is immobilisation on Ni-NTA in combination 
with the polypeptide supplemented with the His tag (hexa- 
histidine) (Purification of proteins using polyhistidine 

25 affinity tags. Bornhorst, Joshua A.; Falke> Joseph J. 

Methods in Enzymology (2000), 326, 245-254) . Use as CLECs 
is also conceivable (St. Clair, N. ; Wahg, : Y.-F.; Margolin, 
A. L. (2000), Cof actor-boxind cross-linked enzyme crystals 
(CLEC) of alcohol dehydrogenase, Angew. Chem. Int. Ed. 39, 

30 380-383) . 

By means of these measures it can be possible to generate 
from polypeptides that are rendered unstable by organic 
solvents polypeptides that are stable and can work in 
mixtures of aqueous and organic solvents or in wholly 
35 orgauiic solvents. 
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Whole cell catalysts are generally used in the form of free 
or immobilised cells. For this purpose, the active cell 
mass is re- suspended in a hydantoin- containing solution. 
The cell concentration is . from 1 to 100 g/1. The 
5 concentration of hydantoin is from 0.1 to 2 molar. H20.is 
preferably used as the solvent, but mixtures of organic 
solvents and H2O can also be used. The pH value is either 
not controlled or is maintained between pH 6 and pH 10 by 
means of conventional buffers or by continuous pH 

10 monitoring. The reaction temperature is typically from 20°C 
to 90*^0. In dependence on the hydantoinase used, divalent 
metal ions are added in concentrations of from 0.1 to 5 roM. 
Preferred metal ions are Mn^*, Zn^"" or Co^*. 
With regard to the use of the individual enzymes, an 

15 equivalent procedure can be employed. 

The products prepared by the use of the hydantoin racemases 
according to the invention in the manner as deiscribed, for 
example, above are worked up by conventional methods. 
However, working up by ion exchange chromatography is 
20 preferred. As a result, the product is freed of thq salts 
formed in the reaction. The eluate is optionally clarified 
using activated carbon and the resulting enantiomerically 
enriched amino acid or N- carbamoyl -amino acid is 
precipitated by concentration of the solvent and dried. 

25 Coupling of an enzymatic racemisation with an 

enantioselective hydrolysis for the screening of hydantoin 
racemase activities has not hitherto been used to produce 
improved hydantoin racemases. For the process according to 
the invention to be applied particularly successfully, 

30 several requirements should be met: 

1. The chemical racemisation rate of the enantiomerically 
pure hydantoin used in the screeing must be very much 
lower than the rate of the enzymatically catalysed 
reaction. 
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2 . The enantioselective hydrolysis by means of the 

hydantoinase must take place very much more rapidly than 
the enzymatic racemisation of the hydantoin. 

For aliphatically substituted hydantoins/ point 1 is met 
5 owing to their slow chemical racemisation. Point 2 can be 
fulfilled by a targeted selection of suitable hydantoinases 
(see hereinabove) . 

The present invention is not rendered obvious by the 
statements madfe in the prior art, because no indications 
10 are to be found therein relating to the requirements 
mentioned hereinbefore. 

All the indicated mutants exhibit a mutation at amino acid 
position 19, which for the first time indicates the 
importance of this position for the enzyme function. It is 

15 interesting that the amino acids surrounding this position 
are retained completely for all known hydemtoin racemases. 
This shows that, for other hydantoin racemases which 
contain the above-described sequence motif and exhibit a 
high degree of homology {>40% sequence identity), improved 

20 enzyme variants can be produced by site-specific 

mutagenesis at position 79, which could not hitherto be 
derived from the prior art . 

within the scope of the invention, the expression optically 
enriched ( enantiomerically enriched) compounds is 
25 understood to mean the presence of one optical antipode in 
admixture with the other in >50 mol.%. 

The expression nucleic acid sequences includes all types of 
single-strand or double-strand DNA as well as RNA or 
mixtures thereof . 

30 According to the invention, the improvement in the activity 
and/or selectivity and/or stability means that the 
polypeptides are more active and/or more selective or less 
selective or more stable iinder the reaction conditions. 
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While the activity and the stability of the enzymes should 
naturally be as high as possible for technical application, 
the selectivity is said to be improved when either the 
substrate selectivity falls but the enantioselectivity of 
the enzymes is increased . 

According to the invention, the claimed polypeptides and 
the nucleic acid sequences also include those sequences 
which exhibit a homology (excluding natural degeneration) 
of greater than 70% (in respect of the nucleic acid 
sequence) or > 40% or 80% (in respect of the polypeptides), 
preferably greater than 90%, 91%, 92%, 93% or 94%, more 
preferably greater than 95% or 96% and particularly 
preferably greater than 97%, 98% or 99%, with one of these 
sequences, provided that the mode of action or purpose of 
such a sequence is retained- The expression "homology" (or 
identity) as used herein can be defined by the equation 
H (%) = [1 - V/X] X 100, where H means homology, X is the 
total number of nucleobases/ amino acids in the comparison 
sequence and V is the nximber of different nucleobases /amino 
acids of the sequence under consideration relative to the 
comparison sequence. In any case, the expression nucleic 
acid sequences coding for polypeptides includes all 
sequences that appear possible according to the degeneration 
of the genetic code. 

The expression "under stringent conditions" is xmderstood 
herein as described in Sambrook et ai. (Sambrook, J, ; 
Fritsch, E. F. andManiatis, T. (1989), Molecular cloning: 
a laboratory manual, 2"^ ed, , Cold Spring Harbor Laboratory 
Press, New York) . A stringent hybridisation according to 
the present invention is preferably present when, after 
washing for one hour with 1 x SSC (150 mM sodium chloride, 
15 mM sodium citrate, pH 7.0) and 0.1 % SDS (sodium 
dodecylsulfate) at SO^^C, preferably at 55^*0, more preferably 
at 62°C and most preferably at eS^'C, and more preferably for 
1 hour with 0.2 x SSC and 0.1 % SDS at SO'^C, more preferably 
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at 55^C, yet more preferably at 62^C and most preferably at 
68**C, a positive hybridisation signal is still observed.. 

The literature references cited in this specification are 
incorporated in the disclosure by reference. 

The organism Arthrobacter aurescens DSM3747 was deposited 
with Deutsche Sammlung, fur Mikrporganismen GmbH, 
Mascheroder Weg lb, 38124 Braunschweig by Rtitgerswerke 
Aktiengesellschaf t on 28.05.86.. >. , 
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Examples 

Example 1: Production of. hydantoin racemase mutants - 
random mutagenesis 

5 0.25 ng of the vector pOM21 (plasmid map see Fig,l; 

sequence see Seq.ID.No.l3) ( PCT/USOO/ 08159 ) was used as 
template in a 100 ]il PGR reaction mix consisting of PGR 
buffer (10 mM Tris, 1.5 mM MgGl2, 50 mM KGl, pH 8.5), 
200 yM dTTP, 200 uM dGTP, 200 yM dATP, 200 yM dCTP, 

10 50 pmol. of the respective primer (see Seq.ID.No.il and 12) 
. cind 2.5 U Taq polymerase (Roche) . After 3 0 cycles, the 
amplified product was purified by means of gel extraction 
(QiaexII gel extraction kit) and subcloned into the vector 
pOM21 by means of the restriction enzymes Ndel and Pstl. 

15 The ligation product was used for the transformation of 
hydantoinase-positive strains (see Example 2) . 

Example 2: Preparation of hydsmtoinase-positive strains and. 
of a mutant library 

Chemically competent E. coli JM109 (e.g. from Promega) were 
20 transformed with 10 ng of the plasmid pDHYD (see Fig. 2; see 
Seq.ID.No. 15), which carries the D-hydantoinase gene from 
Arthr-obacter crystal lopoietes DSM20117 under the control of 
a rhamnose promoter. The complete sequence of the plasmid 
is shown in Seq.ID.No. 15, The hydantoinase-positive strain' 
25 so produced was in turn rendered chemically competent and 
transformed .for the preparation of the mutant library with 
the ligation product of the hydantoin racemase random 
mutagenesis from Example 1. The colonies of the mutant 
library were spread onto ampicillin- and chloramphenicol- 
30 containing agar plates and then subjected to a screening, 
which is described in Example 3 . 
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Example 3: Screening for hydantoin racemase mutants having 
improved enzyme prpperties 

Individual colonies of the mutant library were inoculated 
in 96-well plates which were filled with 100 \il per well of 
5 LB medi\am (5 g/1 yeast extract, 10 g/1 trypton, 10 g/1 

NaCl) supplemented with rhamnose (2 g/1) and ZnCl2 (1 mM) . . 
The plates were incubated for 20 hours at 30°C. 100 \il of 
screening stibstrate (100 mM L--ethylhydantoin, 50 mg/1 
cresol red, pH 8.5) were then added to each well and the 
10 plates were incubated for 4 hours at 20^C. Wells having 
improved hydantoin racemase mutants could be identified 
directly with the eye by means of a more intense yellow 
colouration compared with the wild type, or using a. 
spectral photometer at 580 nm. 

15 Example 4: Characterisation of hydantoin racemase mutants 
having improved enzyme properties- 

The racemase muta:nts identified in the screening were 
subsequently tested by means of HPLC analysis for their 
activity in comparison with the wild type, and the 

20 corresponding mutations were determined' by means of 

sequencing. For this purpose, plasmids were isolated from 
individuail colonies of the different clones (Qiagen Mini-- 
Prep Kit) and sequenced. The same clones were used to 
produce active biomass. An overnight culture (OD6oo=4) of 

25 the respective clones was to this end diluted 1:100 in 

100 ml of LB medium (5 g/1 yeast extract, 10 g/1 trypton, 
10 g/1 NaCl) supplemented with rhamnose (2 g/1) and ZnCl2 
(1 mM) and incubated for 18 hours at 30*^0 and 250 rpm. The 
biomass was pelletised by centrifugation (10 min, 10,000 g) 

30 and the supernatant was discarded, 2 g of active biomass 
were then re-suspended in 50 ml of the substrate solution 
{100 mM L-ethylhydantoin, pH 8.5) and incubated at 37^C. 
Samples were taken after various times, the biomass was 
separated off by centrifugation (5 min, 13,000 rpm) and the 
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supernatant was analysed by means of HPL.C for the 
concentration of the N-carbamoyl-aminobutyric acid formed. 

Example 5: Preparation of L-amino acids using improved 
hydantoin racemases 

5 A strain of E. call JM109 transformed with pOM21-BB5 and 
pOM22 Fig. 3 (see Seq. ID .No. 14) (PCT/USOO/08159 ) was 
incubated at 30°C for 18 hours, with shaking (250 rpm) , in 
LB medium which contained ampicillin (100 |Xg/l) and 
chloramphenicol (50 p.g/1) and to which 2 g/1 of rhamnose 

10 . had been added. The biomass was pelletised by 

centrifugation and re-suspended in a corresponding voliame . 
of 100 mM Dli-ethylhydantoin solution, pH 8.5, and 1 mM 
C0CI2, so that a cell concentration of 30 g/1 was obtained. 
This reaction solution was incubated for 10 hours at 37°C. 

15 The ceils were then separated off by centrifugation 

(30 min, 5000 g) and the clear supernatant was analysed by 
means of HPLC for the resulting amino acid. For working up 
the resulting amino acid, the volume of the supernatant was 
reduced to half, and methanol was added 1:2. The 

20 precipitated amino acid was then filtered off and dried.. 
The total yield of the amino acid was >60%. 

Example 6: Preparation of D-amino acids using improved 
hydantoin racemases 

A strain of coli JMl 09 transformed with pOM21-BB5 and 
25 pJAVIERlS Fig. 4 (see Seq. ID.No . 16) was incubated at 30°C 
for 18 hours, with shaking (250 rpm) , in LB medium which 
contained ampicillin (100 ^ig/l) and chloramphenicol (50 
jig/1) and to which 2 g/1 of rhamnose had been added. The 
biomass was pelletised by centrifugation and re-suspended 
30 in a corresponding volume of 100 mM DL-ethylhydantoin 
solution, pH 8.5, and 1 mM C0CI2/ so that a cell 
concentration of 30 g/1 was obtained. This reaction 
solution was incubated for 10 hours at 37**C. The cells were 
then separated off by centrifugation (30 min, 5000 g) and 
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the clear supernatant was analysed by meaLns of HPLC for the 
resulting amino acid. For working up the resulting amino 
acid, the volume of the supernatant was reduced to half, 
and methanol was added 1:2 1 The precipitated amino acid was 
then filtered off eind dried. The total yield of the amino 
acid was >60% . 
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Patent claims: 

1. Screening process for hydantoin racemases, 
characterised in that 

a) an enantios elective hydantoinase and 

b) the hydcintoin racemase to be tested, which has a 
slower conversion rate compared with the 
hydantoinase under a) , are allowed to act on 

c) a chiral hydantoin, which is used in the opposite 
enantiomerically enriched form to the selectivity 
of the hydantoinase, and 

d) the resulting N- carbamoyl -amino acid or the freed 
protons are detected in a time- dependent manner. 

2. Process according to claim 1, 
characterised in that 

an aliphatically sxibstituted hydantoin is used. 

3 . Process according to one or more of the preceding 
claims, 

characterised in that 

a hydantoinase from Arthjrojbacter- cirystallopoietes is 
used. 

4. Process according to one or more of the preceding 
claims, 

characterised in that 

the ratio of the rate constants of the hydantoinase to 
the hydantoin racemase (khyd/krac) is > 2 . 

5. Process for the preparation of improved hydantoin 
rac entases, 

characterised in that 

a) the nucleic acid sequence coding for the hydantoin 
racemase is subjected to a mutagenesis, 

b) the nucleic acid sequences obtainable from a) are 
cloned into a suitable vector and the vector is 
transferred into a suitable expression system, and 
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c) the resulting hydantoin racemases having improved 
activity and/or selectivity and/ or stability are 
detected by means of a process according to one or 
more of claims 1 to 4 and isolated. 

6. rec-Polypeptides or nucleic acid sequences coding 
therefor obtainable according to claim 5 . 

7. Use of the polypeptides according to claim 6 in the 
preparation of enantiomerically enriched N-carbamoyl- 
amino acid or amino acids . 

8. Use of the nucleic acid sequences according to claim 6 
in the preparation of whole cell catalysts . 

9. Hydantoin racemase containing in posit ion 79 an amino 
acid sTibstitution with an amino acid selected from the 
group consisting of A, R, N/ D, C, Q, E, H, I, L, K, 

M, F, P, S, T, Y and V. 

10. Hydantoin racemases containing the consensus sequence 
FX1DX2GL (Seq. 1), wherein X2 represents P or T and Xi 
represents in position 79 an aidino acid selected from 
the group A, R, N, D, C, Q, E, H, I, K, M, F, P, S, 
T, Y or V. 

11. Isolated nucleic acid sequence coding for k hydantoin 
racemase selected from the group: 

a) a nucleic acid sequence coding for a hydantoin 
racemase according to claim 9 and/or 10, 
... b) a nucleic acid sequence which hybridises under 
stringent conditions with the nucleic acid 
sequence coding for a hydantoin racemase according 
to claim 9 and/or 10 or with the secjuence 
complementary thereto, 

c) a nucleic acid sequence according to Seq. ID. No. 3, 
5, 7 or 9 or a nucleic acid sequence having a 
homology of > 80% therewith, 

d) a nucleic acid sequence containing 15 successive 
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nucleotides of the sequences Seq.ID.No, 3, 5, 7 
or 9, 

12. Whole cell catalyst containing a cloned gene for a 
hydantoin raceiriase according to claims 9 and/or 10. 

13. Plasmids, vectors or microorganisms containing a 
nucleic acid sequence according to claim 9 and/or 10. 

14. Primers for the preparation of the nucleic acid 
sequences according to claim 9 and/ or 10 by means of 
PGR. 
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Fig. 2: 
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Fig: 3 
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Fig. 4 
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SEQUENCE LISTING 
<110> Degussa AG 

<120> Screening process for hydantoin racemases 
<130> 030115 AM 



<140> 
10 <141> 



<160> 16 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 6 
<212> PRT 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence 
Consensus sequence 

25 <400> 1 

Phe Xaa Asp Xaa Gly Leu 

1 5- - 

30 <210> 2 

<211> 236 
<212> PRT 

<213> Arthrobacter crystallopoietes 
35 <400> 2 

Met Arg lie Leu Val lie Asn Pro Asn Ser Ser Ser Ala Leu Thr Glu 
1 5 10 15 

Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr He He 
40 20 25 30 

Ser Ala He Asn Pro Ser Arg Gly Pro Ala Val He Glu Gly Ser Phe 
35 40 45 

45 Asp Glu Ala Leu Ala Thr Phe His Leu He Glu Glu Val Glu Arg Ala. 
50 55 60 



Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val He Ala Cys Phe Gly Asp 
65 70 75 80 

Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg Pro Val Val Gly 
85 90 95 



Val Ala Glu Ala Ala He His Met Ser Ser Phe Val Ala Ala Thr Phe 
55 100 105 110 

Ser He Val Ser He Leu Pro Arg Val Arg Lys His Leu His Glu Leu 
115 120 125 
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Val Axg Gin Ala Gly Ala Thr Asn Arg Leu Ala Ser lie Lys Leu Piro 
130 135 140 

Asn Leu Gly Val Met Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 
5 145 150 155 160 

Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 
165 170 175 

10 Ser lie Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
180 185 190 



15 



25 



30 



35 



Ser Asp Glu Leu Gly Val Pro Val He Asp. Pro Val Glu Ala Ala Cys 
195 200 205 

Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
210 215 220 

Asn Ser Tyr Gin Lys Pro Thr Glu Lys Gin Tyr Leu 



225 


230 


235 


<210> 
<211> 
<212> 
<213> 


3 

711 
DNA 

Artificial sequence 




<220> 
<223> 


Description of the artificial 


sequence : 1BG7 


<220> 
<221> 
<222> 


CDS 

{1)..(711) 




<400> 3 

atg aga ate etc gtg ate aac ece aac 
Met Arg He Leu Val He Asn Pro Asn 
1 5 


agt tec age gee 
Ser Ser Ser Ala 
10 



15 



48 



40 teg gtt gcg gac gca gca caa caa gtt gtc gcg acc ggc ace ata att 96 

Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr He He 

20 25 30 

tct gee ate aac ece tec aga gga ece gee gtc att gaa ggc age ttt 144 

45 Ser Ala He Asn' Pro Ser Arg Gly Pro Ala Val He Glu Gly Ser Phe 

35 40 . 45 

gac gaa gca etg gee acg ttc eat etc att gaa gag gtg gag cge get 192 

Asp Glu Ala Leu Ala Thr Phe His Leu He Glu Glu Val Glu Arg Ala 

50 50 55 60 



gag egg gaa aac ccg ece gac gee tac gtc ate gca tgt ttc agg gat 240 
Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val He Ala Cys Phe Arg Asp 
65 70 75 80 

ccg gga ctt gac gcg gtc aag gag etg act gac agg eca gtg gta gga 288 
Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg Pro Val Val Gly 
85 90 95 
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gtt gcc gaa get gca ate eac atg tct tea tte gtc gcg gee ace tte 336 
Val Ala Glu Ala Ala He His Met Ser Ser Phe Val Ala Ala Thr Phe 
100 105 - 110 



tec att gtc age ate etc 
Ser He Val Ser He Leu 
115 

gta egg caa gcg ggg gcg 
Val Arg Gin Ala Gly Ala 
130 

aat ctg ggg gtg atg gcc 
Asn Leu Gly Val Met Ala 
145 150 

aeg etc aaa eaa gcc gcc 
. Thr Leu Lys Gin Ala Ala 
165 

teg ata gtg etc gga tgc 
Ser He Val Leu Gly Cys 
180 

age gac gaa etc ggc gtc 
Ser Asp Glu Leu Gly Val 
195 

cgc gtg gcc gag agt ttg 
Arg Val Ala Glu Ser Leu . 
210 

aac teg tat eaa aaa ecg 
Asn Ser Tyr Gin Lys Pro 
225 230 



ecg agg gtc agg aaa eat 
Pro Arg Val Arg Lys His 
120 

acg aat cgc etc gcc tec 
Thx Asn Arg Leu Ala Ser 
135 140 

tte cat gag gac gaa cat 
Phe His Glu Asp Glu His 
155 

aag 
Lys 



gcc 
Ala 



cet gtc ate gac ecc gtc 
Pro Val lie Asp Pro Val 
200 



ctg cae gaa ctg 3 84 

Leu His Glu Leu 

125 

ate aag etc cea 432 
He Lys Leu Pro 



gcc gca ctg gag 480 
Ala Ala Leu Glu 
160 



gag gca get tgc 624 

Glu Ala Ala Cys 

205 



gtc get ctg ggc tac cag acc age aaa gcg 672 
Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
215 220 

aca gag aag eag tac etc tag 711 
Thr Glu Lys Gin Tyr Leu 
235 



gag gcg gtc cag gag gac ggc gcc gag 528 
Glu Ala Val Gin Glu Asp Gly Ala Glu 
170 175 

ggc atg gtg ggg ttt gcg egt caa ctg 576 
Gly Met Val Gly Phe Ala Arg Gin Leu 
185 190 



<210> 4 . . 
<211> 237 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence : 1BG7 
<400> 4 

Met Arg He Leu Val He Asn Pro Asn Ser Ser Ser Ala Leu Thr Glu 
1 5 10 . 15 

Ser Val Ala Asp Ala Ala Gin Gin Val . Val Ala Thr Gly Thr He He 
20 25 30 



Ser Ala He Asn Pro Ser Arg Gly Pro Ala Val He Glu Gly Ser Phe 
35 40 45 

Asp Glu Ala Leu Ala Thr Phe His Leu He Glu Glu Val Glu Arg Ala 
50 55 60 



Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val He Ala Cys Phe Arg Asp 
65 70 75 80 
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Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg Pro Val Val Gly 
85 90 95 

Val Ala Glu Ala Ala lie His Met Ser Ser Phe Val Ala Ala Thr Phe 
5 100 105 110 

Ser lie Val Ser lie Leu Pro Arg Val Arg Lys His Leu His Glu Leu 
115 120 125 

10 Val Arg Gin Ala Gly Ala Thr Asn Arg Leu Ala . Ser lie Lys Leu Pro 
130 135 140 



15 



Asn Leu Gly Val Met Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 
145 150 155 160 

Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 

. . 165 170 . 175 



Ser lie Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
20 180 185 190 

Ser Asp Glu. Leu Gly Val Pro Val lie Asp Pro Val Glu Ala Ala Cys 
195 200 205 

25 Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
210 215 220 

Asn Ser Tyr Gin Lys Pro Thr Glu Lys Gin Tyr Leu 





225 


230 


.235 






30 














<210> 


5 










<211> 


711 










<212> 


DNA 








35 


<213> 


Artificial sequence 










<220> 












<223> 


Description of the artificial 


sequence : 3CH11 






40 


<220> 












<221> 


CDS 










<222> 


(1)..(711) 










<400> 


5 








45 


atg aga ate etc gtg ate aac ccc aac 


agt tec age gee 


ett 


act 




Met Arg lie Leu Val lie Asn Pro Asn 


Ser Ser Ser Ala 


Leu 


Thr 




1 


5 


10 




15 



48 



teg gtt geg gae gea gca caa caa gtt gtc gcg ace ggc acc ata att 96 

50 Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr lie lie 

20 25 30 

tct gcc ate aac ccc tec aga gga ccc gee gtc att gaa ggc age ttt 144 

Ser Ala He Asn Pro Ser Arg Gly Pro Ala Val He Glu Gly Ser Phe 
55 35 40 45 

gae gaa gea ctg gcc acg ttc eat etc att gaa gag gtg gag cgc get 192 

Asp Glu Ala Leu Ala Thr Phe His Leu He Glu Glu Val Glu Arg Ala 

50 55 60 
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gag egg gaa aac ccg ccc gac gcc tac gtc ate gca tgt ttc gag gat 240 
Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val lie Ala Cys Phe Glu Asp 
65 70 75 80 

5 

ccg gga ctt gac gcg gte aag gag ctg act gac agg cca gtg gta gga 288 
Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg. Pro Val Val Gly 
85 90 95 

10 gtt gcc gaa get gca ate cae atg tct tea ttc gte gcg gcc acc ttc 336 
Val Ala Glu Ala Ala lie His Met Ser Ser Phe Val Ala Ala Thr Phe 
100 105 110 

tec att gtc age ate etc ccg agg gtc agg aaa cat ctg cac gaa ctg 384 
15 Ser lie Val Ser lie Leu Pro Arg Val Arg Lys His Leu His Glu Leu 
115 120 125 

gta egg caa gcg ggg gcg aeg aat cge etc gcc tec ate aag etc cca 432 
Val Arg Gin Ala Gly Ala Thr Asn Arg Leu Ala Ser lie Lys Leu Pro 
20 130 135 140 

aat ctg ggg gtg atg gcc ttc cat gag gac gaa eat gcc gca ctg gag 480 

Asn Leu Gly Val Met Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 
145 150 155 160 

25 

aeg etc aaa caa gcc gcc aag gag gcg gtc cag gag gac ggc gcc gag 528 

Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 
165 .170 175 

30 teg ata gtg etc gga tgc gcc ggc atg gtg ggg ttt gcg cgt caa ctg 576 
Ser lie Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
180 185 . 190 

age gac gaa etc ggc gtc cet gte ate gac ccc gtc gag gca get tgc 624 
35 Ser Asp Glu Leu Gly Val Pro Val lie Asp Pro Val Glu Ala Ala Cys 
195 200 205 

cge gtg gee gag agt ttg gtc get ctg ggc tac cag acc age aaa gcg 672 
Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
40 210 215 220 

aac teg tat caa aaa ccg aca gag aag cag tac etc tag 711 
Asn Ser Tyr Gin Lys Pro Thr Glu Lys Gin Tyr Leu 
225 230 235 

45 

<210> 6 
<211> 237 
<212> PRT 
50 <213> Artificial sequence 

<223> Description of the artificial sequence :3CH11 

<400> 6 

Met Arg lie Leu Val lie Asn Pro Asn Ser Ser Ser Ala Leu Thr Glu 
55 1 5 10 15 



Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr He He 
20 25 30 
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Ser Ala lie Asn Pro Ser Arg Gly Pro Ala Val lie Glu Gly Ser Phe 
35 40 45 

Asp Glu Ala Leu Ala Thr Phe His Leu lie Glu Glu Val Glu Arg Ala 
50 55 60 

Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val lie Ala Cys Phe Glu Asp 
65 70 75 80 

Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg Pro Val Val Gly 
85 . 90 . 95 

Val Ala Glu Ala Ala lie His Met Ser Ser Phe Val Ala Ala Thr Phe 
100 105 110 

Ser lie Val Ser He Leu Pro Arg Val Arg Lys His Leu His Glu Leu 
115 120 125 

Val Arg Gin Ala Gly Ala Thr Asn Arg Leu Ala Ser He Lys Leu Pro . 
130 135 140 

Asn Leu Gly Val Met Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 
145 150 155 16b 

Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 
165 ' 170 175 

Ser lie Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
180 185 .190. 

Ser Asp Glu Leu Gly Val Prb Val He Asp Pro Val Glu Ala Ala Cys 
195 200 205 

Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
210 215 220 

Asn Ser Tyr Gin Lys Pro Thr Glu Lys Gin Tyr Leu 
225 230 235 



<210> 7 
<2il> 711 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence :AE3 



<220> 

<221> CDS 

<222> (1)..{711) 

<400> 7 

atg aga ate etc gtg ate aac ecc aac agt tec age gee ctt act gaa 48 

Met Arg He Leu Val He Asn Pro Asn Ser Ser Ser Ala Leu Thr Glu 
15 10 15 
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teg gtt gcg gac gca gca caa caa gtt gtc gcg acc ggc acc ata att 96 
Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr lie lie 
20 25 . 30 

5 tct gcc ate aac ccc tec aga gga ccc gee gtc att gaa ggc age ttt 144 
Ser Ala lie Asn Pro Ser Arg Gly Pro Ala Val lie Glu Gly Ser Phe 
35 40 45 

gac gaa gca ctg gcc acg ttc cat etc att gaa gag gtg gag cgc get 192. 
10 Asp Glu Ala Leu Ala Thr Phe His Leu lie Glu Glu Val Glu Arg Ala 
50 55 .60 

gag egg gaa aac ccg cec gac gcc tac gtc ate gca tgt ttc cag gat 240 
Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val lie Ala Cys Phe Gin Asp 
15 65 70 75 80 

ccg gga ctt gac gcg gtc aag gag ctg act gac agg eca gtg gta gga 288 
Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg Pro Val Val Gly 
85 90 95 

20 . 

gtt gcc gaa get gca ate cac atg tct tea ttc gtc gcg gcc acc ttc 336 
Val Ala. Glu Ala Ala lie His Met Ser Ser Phe Val Ala Ala Thr Phe 
100 105 110 

25 tec. att gtc age ate etc ccg agg gtc agg aaa cat ctg cac gaa ctg 384 
Ser lie Val Ser He Leu Pro Arg Val Arg Lys His Leu His Glu Leu 
115 120 125 

gta egg caa gcg ggg gcg acg aat cgc etc gcc tec ate aag etc eca 432 
30. Val Arg Gin Ala Gly Ala Thr Asn Arg Leu Ala Ser He Lys Leu Pro 
130 135 140 

aat ctg ggg gtg atg gcc ttc cat gag gac gaa cat gee gca ctg gag 480 
Asn Leu Gly Val Met Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 
35 .145 150 155 160 

acg etc aaa caa gcc gcc aag gag gcg gtc cag gag gac ggc gcc gag 528 
Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 
165 170 175 



40 



teg ata gtg etc gga tgc gcc ggc atg gtg ggg ttt gcg cgt caa ctg 576 
Ser He Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
180 185 190 



45 age gac gaa etc ggc gtc cct gtc ate gac ccc gtc gag gca get tgc 624 

Ser Asp Glu Leu Gly Val Pro Val He Asp Pro Val Glu Ala Ala Cys. 
195 200 205 

cgc gtg gcc gag agt ttg gtc get ctg ggc tac cag ace age aaa gcg 672 

50 Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
210 215 220 

aac teg tat caa aaa ccg aca gag aag cag tac etc tag 711 

Asn Ser Tyr Gin Lys Pro Thr Glu Lys Gin Tyr Leu 
55 225 230 235 



<210> 8 
<211> 237 
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<212> PRT - 

<213> Artificial sequence 

<223> Description of the artificial sequence :AE3 
5 <400> 8 

Met Arg lie Leu Val lie Asn Pro Asn Ser Ser. Ser Ala Leu Thr Glu 
1 5 10 15 

Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr He He 
10 .20 25 30 . 

Ser Ala He Asn Pro Ser Arg Gly Pro Ala Val He Glu Gly Ser Phe 
35. 40. 45 

15 Asp Glu Ala Leu Ala Thr Phe His Leu He Glu Glu Val Glu Arg Ala. 
50 55 60 

Glu Arg Glu, Asn Pro Pro Asp Ala Tyr Val He Ala. Cys Phe Gin Asp 
65 70 75 80 

20 

Pro Gly Leu. Asp Ala. Val Lys Glu Leu; T:hr Asp Arg Pro Val Val Gly 
85 : . 90 95 

Val Ala Glu Ala Ala He His Met Ser Ser Phe Val Ala Ala Thr Phe 
25 100 . . 105 . 110 . 

Ser He Vai Ser He Leu Pro Arg Val Arg Lys His Leu His Glu Leu 
115 .120 125 

30 Val Arg Gin Ala Gly Ala Thr Asn Arg Leu Ala Ser: lie Lys Leu Pro 
130 135 140 

Asn Leu Gly Val Met . Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 

145 . . 150 ' - 155 . 160 

35 

Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 
./,165 V 170. 175 V 

Ser He Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
40 180 185 190 

Ser Asp Glu Leu Gly Val Pro Val He Asp' Pro Val Glu Ala Ala Cys 
195 200 205 

45 . Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala. 
210 215, 220 

Asn Ser Tyr Gin Lys Pro Thr Glu Lys Gin Tyr Leu 
225 230 235 

50 



<210> 9 

<211> 711 

<212> DNA 

55 <213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: BB5 



wo 2004/111227 



PCT/EP2(MM/005239 



<220> 

<221> CDS 

<222> (1) . . (711) 

<400> 9 

atg aga ate etc gtg ate aac cce aac agt tec. age gee ctt aet gaa 48 
Met Arg He Leu Val He Asn Pro Asn Ser Ser Ser Ala Leu Thr Glu 



1 5 10 



15 



teg gtt gcg gac gca gca eaa eaa gtt gte gcg aec gge aec ata att 96 
Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr He lie 
20 25 30 

tct gcc ate aae ccc tec aga gga ccc gee gtc att gaa ggc age ttt i44 
Ser Ala He Asn Pro Ser Arg Gly Pro Ala Val He Glu Gly Ser Phe 
35 40 45 

gac gaa gca etg gee acg ttc eat etc att gaa gag gtg gag cgc get 192 
Asp Glu Ala Leu Ala Thr Phe His Leu He Glu Glu Val Glu Arg Ala 
50 55 60 

gag egg gaa aac ecg ccc gac gcc tac gte ate gca tgt ttc ttg gat 240 
Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val He Ala Cys Phe Leu Asp 
65 70 75 80 

ceg gga ctt gac gcg gtc aag gag ctg act gac agg cca gtg gta gga 288 
Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg Pro Val Val Gly 
85 90 95 

gtt gee gaa get gca ate cac atg tct tea ttc gtc gcg gcc ace ttc 336 
Val Ala Glu Ala Ala He His Met Ser Ser Phe Val. Ala Ala Thr Phe 
100 105 110 

tec att gtc age ate etc ecg agg gtc agg aaa cat ctg cac gaa ctg 384 
Ser He Val Ser He Leu Pro Arg Val Arg Lys His Leu His Glu Leu 
115 120 125 

gta egg eaa gcg ggg gcg acg aat cgc etc gcc tec ate aag etc cca 432 
Val ;^g Gin Ala Gly Ala Thr Asn Arg Leu Ala Ser He Lys Leu Pro 
130 135 140 

aat ctg ggg gtg atg gcc ttc cat gag gac gaa cat gee gca ctg gag 480 
Asn Leu Gly Val Met Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 
145 150 155 160 

acg etc aaa eaa gcc gcc aag gag gcg gtc cag gag gac ggc gcc gag 528 
Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 
165 170 175 

teg ata gtg etc gga tgc gee ggc atg gtg ggg ttt gcg cgt eaa ctg 576 
Ser He Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
180 185 190 

age gac gaa etc ggc gtc cet gte ate gac ccc gtc gag gca get tgc 624 
Ser Asp Glu Leu Gly Val Pro Val He Asp Pro Val Glu Ala Ala Cys 
195 200 205 
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cgc gtg gcc gag agt ttg gtc get ctg ggc tac cag acc age aaa gcg 672 

Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
210 215 , 220 

5 aac teg tat caa aaa ccg aea gag aag cag tac etc tag 711 
Asn Ser Tyr Gin Lys Pro Thr Glu Lys Gin Tyr Leu 
225 230 235 

10 <210> 10 
<211> 237 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence :BB5 

15 

<400> 10 

Met Arg lie Leu Val lie Asn Pro Asn Ser Ser Ser Ala Leu Thr Glu 
1 . 5 10 . 15 

20 Ser Val Ala Asp Ala Ala Gin Gin Val Val Ala Thr Gly Thr lie lie 

20 25 30 

Ser Ala lie Asn = Pro Ser Arg Gly Pro Ala Val lie Glu Gly Ser Phe 
35 40 45 



25 



40 



55 



Asp Glu Ala Leu r Ala Thr Phe His Leu lie Glu Glu Val Glu Arg Ala 
50 55 60 



Glu Arg Glu Asn Pro Pro Asp Ala Tyr Val lie Ala Cys Phe Leu Asp 
30 65 70 75 . 80 

Pro Gly Leu Asp Ala Val Lys Glu Leu Thr Asp Arg Pro Val Val Gly 
85 90 95 

35 Val Ala Glu Ala Ala lie His Met Ser Ser Phe Val Ala Ala Thr Phe 
100 105 110 



Ser lie Val Ser lie Leu Pro, Arg Val Arg Lys His Leu His Glu Leu 

115,.^ . . 120 . 125 ■' 

Val Arg Gin. Ala Gly Ala Thr Asn Arg Leu Ala Ser lie Lys Leu Pro 
130 135 140 



Asn Leu Gly Val Met Ala Phe His Glu Asp Glu His Ala Ala Leu Glu 
45 145 150 155 160 

Thr Leu Lys Gin Ala Ala Lys Glu Ala Val Gin Glu Asp Gly Ala Glu 
165 170 175 

50 Ser lie Val Leu Gly Cys Ala Gly Met Val Gly Phe Ala Arg Gin Leu 
180 185 190 



Ser Asp Glu Leu Gly Val Pro Val lie Asp Pro Val Glu Ala Ala Cys 
195 200 205 

Arg Val Ala Glu Ser Leu Val Ala Leu Gly Tyr Gin Thr Ser Lys Ala 
210 215 220 
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Asn Ser Tyx Gin Liys Pro Thr Glu Lys Gin Tyr Leu 
225 230 235 

- 5 <210> 11 

<211> 25 . 
<212> DNA 

<213>. Artificial sequence 
10 <220> 

<223> Description, of the artifical sequence: PrimerS 



15 



45 



50 



55 



<400> 11. 

gccgcaagga atggtgcatg cat eg 25 



<210> 12 

<211> 30 

<212> DNA 

2 0 <213> Artificial sequence 

<220> 

. <223> Description of the artificial sequence: PrimerS 

25 <400> 12 . 

ggtcaggtgg gtccaccgcg ctactgccgc 30 

<210> 13 

30 <211> 5777 

<212> DNA 

<213> Artificial sequence 
<220> 

35 <223> Description of the artificial sequence :Plasinid pOM21 



40 



.<400> 13 
aattcttaag 


aaggagatat 


acatatgaga 


atcctcgtga 


tcaaccccaa 


cagttccagc 


60 


gcccttactg 


aatcggttgc 


ggacgcagca 


caacaagttg 


tcgcgaccgg 


caeca taatt 


120 


tctgccatca 


acccctccag 


aggacccgcc 


gtcattgaag 


gcagctttga 


cgaagcactg 


180 


gccacgttcc 


atctcattga 


agaggtggag 


cgcgctgagc 


gggaaaaccc 


gcccgacgcc 


240 


tacgtcatcg 


catgtttcgg 


ggatccggga 


cttgacgcgg 


tcaaggagct 


gactgacagg 


300 


ccagtggtag 


gagttgccga 


agctgcaatc 


cacatgtctt 


cattcgtcgc 


ggcca;ccttc 


360 


tccattgtca 


gcatcctccc 


gagggtcagg 


aaacatctgc 


acgaactggt 


acggcaagcg 


420 


ggggcgacga 


atcgcctcgc 


ctccatcaag 


ctcccaaatc 


tgggggtgat 


ggccttccat 


480 


gaggacgaac 


atgccgcact 


ggagacgctc 


aaacaagccg 


ccaaggaggc 


ggtccaggag 


540 


gacggcgccg 


agtcgatagt 


gctcggatgc 


gccggcatgg 


tggggtttgc 


gcgtcaactg 


600 


agcgacgaac 


tcggcgtccc 


tgtcatcgac 


cccgtcgagg 


cagcttgccg 


cgtggccgag 


660 
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agtttggtcg 


ctctgggcta 


ccagaccagc 


aaag[cgaact: 


cgtatcaaaa 


accgacagag 720 


aagcagtacc 


tctagctgca 


gccaagcttc 


tgttttggcg 


gatgagagaa 


gattttcagc 780 


ctgatacaga 


ttaaatcaga 


acgcagaagc 


ggtctgataa 


aacagaattt 


gcctggcggc 840 


agtagcgcgg 


tggtcccacc 


tgaccccatg 


ccgaactcag 


aagtgaaacg 


ccgtagcgcc 900 


gatggtagtg 


tggggtctcc 


ccatgcgaga 


. gtagggaact 


gccaggcatc 


aaataaaacg 960 


aaaggctcag 


tcgaaagact 


gggcctttcg 


ttttatctgt 


tgtttgtcgg 


tgaacgctct 1020 


cctgagtagg 


acaaatccgc 


cgggagcgga 


tttgaacgtt 


gcgaagcaac 


ggcccggagg 1080 


gtggcgggca 


ggacgcccgc 


cataaactgc 


caggcatcaa 


attaagcaga 


aggccatcct 1140 


gacggatggc 


ctttttgcgt 


ttctacaaac 


tcttttgttt 


atttttctaa 


atacattcaa 1200 


atatgtatcc 


gctcatgaga 


caataaccct 


gataaatgct 


tcaataatat 


cgtccattcc 1260 


gacagcatcg 


ccagtcacta 


tggcgtgctg 


ctagcgctat 


atgcgttgat 


gcaatttcta 1320 


tgcgcacccg 


ttctcggagc 


actgtccgac 


cgctttggcc gccgcccagt 


cctgctcgct 1380 


tcgctacttg 


gagccactat 


cgactacgcg 


atcatggcga 


ccacacccgt 


cctgtggat'c 1440 


ctctacgccg 


gacgcatcgt 


ggccggcatc 


accggcgcca 


caggtgcggt 


tgctggcgcc 1500 


tatatcgccg 


acatcaccga 


tggggaagat 


cgggctcgcc 


acttcgggct 


catgagcgct 1560 


tgtttcggcg 


tgggtatggt 


ggcaggcccc 


gtggccgggg 


gactgttggg 


cgcc^tctcc 1620 


ttgcatgcac 


cattccttgc 


ggcggcggtg 


ctcaacggcc 


tcaacctact 


actgggctgc 1680 


ttcctaatgc 


aggagtcgca 


taagggaigag 


cgtcgaccga 


tgcccttgag 


cigccttcaac 1740 


ccagtcagct 


ccttccggtg 


ggcgcggggc 


atgactatcg 


tcgccgcact 


tatgactgtc 1800 


ttctttatca 


tgcaactcgt 


aggacaggtg 


ccggcagcgc 


tctgggtcat 


tttcggcgag 1860 


gaccgctttc 


gctggagcgc 


gacgatgatc 


ggcctgtcgc 


ttgcggtatt 


cggaatcttg 1920 


cacgccctcg 


ctcaagcctt 


cgtcactggt 


cccgccacca 


aacgtttcgg 


cgagaagcag 1980 


gccattatcg 


ccggcatggc 


ggccgacgcg 


ctgggctacg 


tcttgctggc 


gttcgcgacg 2040 


cgaggctgga 


tggccttccc 


cattatgatt 


cttctcgctt 


ccggcggcat 


cgggatgccc 2100 


gcgttgcagg 


ccatgctgtc 


caggcaggta 


gatgacgacc 


atcagggaca 


gcttcaagga 2160 


t CQC t CQcaa 


ctictitiaccacr 


cc taact tea 


atcactggac 


cgctgatcgt 


cacggcgauu z^^xj 


tatgccgcct 


cggcgagcac 


atggaacggg 


ttggcatgga 


ttgtaggcgc 


cgccctatac 2280 


cttgtctgcc 


tccccgcgtt 


gcgtcgcggt 


gcatggagcc 


gggccacctc 


gacctgaatg 2340 


gaagccggcg 


gcacctcgct 


aacggattca 


ccactccaag 


aattggagcc 


aatcaattct 2400 


tgcggagaac 


tgtgaatgcg 


caaaccaacc 


cttggcagaa 


catatccatc 


gcgtccgcca 2460 
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tctccagcag ccgcacgcgg cgcatctcgg 
tgatcgtgct cctgtcgttg aggacccggc 

5 

agaatgaatc accgatacgc gagcgaacgt 
cctgagcaac aacatgaatg gtcttcggtt 
10 agtcccctac- gtgctgctga agttgcccgc 
cgatactatg actgagagtc aacgccatga 
agtccgcacc gctgtccggt agctccttcc 

15 

cacttatgac tgtcttcttt atcatgcaac 
gtcccccggc cacggggcct gccaccatac 
20 tgttccggat ctgcatcgca ggatgctgct 
taacgaagcg ctaaccgttt ttatcaggct 
aattattacc tccacgggga gagcctgagc 

25 

ggtcacactg cttccggtag tcaataaacc 

ttaacgaccc tgccctgaac cgacgaccgg 

30 atccgcttat tatcacttat tcaggcgtag 

cttaaaaaaa ttacgccccg ccctgccact 

ttctgccgac atggaagcca tcacagacgg 
35 . 

gcaccttgtc gccttgcgta taatatttgc 

ccatattggc cacgtttaaa tcaaaactgg 

40 aaaacatatt ctcaataaac cctttaggga 

catcttgcga atatatgtgt agaaactgcc 

atgaaaacgt ttcagtttgc tcatggaaaa 

45 

tcaccagctc accgtctttc attgccatac 
aagaatgtga ataaaggccg gataaaactt 
50 ggccgtaata tccagctgaa cggtctggtt 
ctcaaaatgt tctttacgat gccattggga 
tttctccatt ttagcttcct tagctcctga 

55 

tagtgatctt atttcattat ggtgaaagtt 
ttttcgccaa aagttggccc agggcttccc 
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gcagcgttgg gtcctggcca cgggtgcgca 2520 
taggctggcg gggttgcctt actggttagc 2580 
gaagcgactg. ctgctgcaaa acgtctgcga 2640 
tccgtgtttc gtaaagtctg gaaacgcgga 2700 
aacagagagt ggaaccaacc ggtgatacca 2760 
gcggcctcat ttcttattct gagttacaac 2820 
ggtgggcgcg gggcatgact atcgtcgccg 2880 
tcgtaggaca ggtgccggca gcgcccaaca 2940 
ccacgccgaa acaagcgccc tgcaccatta 3000 
ggctaccctg tggaacacct acatctgtat 3060 
ctgggaggca gaataaatga tcatatcgtc 3120 
aaactggcct caggcatttg agaagcacac 3180 
ggtaaaccag caatagacat aagcggctat 3240 
gtcgaatttg ctttcgaatt tctgccattc 3300 
caccaggcgt ttaagggcac caataactgc 3360 
catcgcagta ctgttgtaat tcattaagca 3420 
catgatgaac ctgaatcgcc agcggcatca 3480 
ccatggtgaa aacgggggcg aagaagttgt 3540 
tgaaactcac ccagggattg gctgagacga 3600 
aataggccag gttttcaccg taacacgcca 3660 
ggaaatcgtc gtggtattca ctccagagcg 3720 
cggtgtaaca agggtgaaca ctatcccata 3780 
gaattccgga tgagcattca tcaggcgggc 3840 
gtgcttattt ttctttacgg tctttaaaaa 3900 
ataggtacat tgagcaactg actgaaatgc 3960 
tatatcaacg gtggtatatc cagtgatttt 4020 
aaatctcgat aactcaaaaa atacgcccgg 4080 
ggaacctctt acgtgccgat caacgtctca 4140 
ggtatcaaca gggacaccag gatttattta 4200 
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ttctgcgaag 


tgatcttccg 


tcacaggtat 




tgccaactta 


ctgatttagt 


gtatgatggt 


5 


ctatcagctg 


tccctcctgt 


tcagctactg 




ccggacatca 


gcgctagcgg 


agtgtatact 


10 


cagtgaagtg 


cttcatgtgg 


caggagaaaa 


tgatacagga 


tatattccgc 


ttcctcgctc 




cggcgagcgg 


aaatggctta 


cgaacggggc 


15 


cttaacaggg 


aagtgagagg 


gccgcggcaa 




acaagcatca 


cgaaatctga 


cgctcaaatc 


20 


gataccaggc 


gtttcccctg 


gcggctccct 


taccggtgtc 


attccgctgt 


tatggccgcg 




ccgggtaggc 


agttcgctcc 


aagctggact 


25 


gctgcgcctt 


atccggtaac 


tatcgtcttg 




cactggcagc 


agccactggt 


aattgattta 


30 


taaggctaaa 


ctgaaaggac 


aagttttggt 




ttcaaagagt 


tggtagctca 


gagaaccttc 




ttttcagagc 


aagagattac 


gcgcagacca 


35 


cagataaaat 


atttcaagat 


ttcagtgcaa 




agccccatac 


gatataagtt 


gtaattctca 


40 


atgcggtagt 


ttatcacagt 


taaattgcta 




caatgcgctc 


atcgtcatcc 


tcggcaccgt 




tatgccggta 


ctgccgggcc 


tcttgcggga 


45 


ctgactgggt 


t.gaaggct:ct. 


caagggcatc 




attaggaagc 


agcccagtag 


taggttgagg 




gcatgcatcg 


atcaccacaa 


ttcagcaaat 


gttgccaatg 
gactggtcgt 


gcccattttc 
aatgaac 


ctgtcagtaa 


55 


<210> 14 
<211> 7175 
<212> DNA 







<213> Artificial sequence 



PCT/EP2004/005239 

14 

ttattcggcg caaagtgcgt cgggtgatgc 4260 
gtttttgagg tgctccagtg gcttctgttt 4320 
acggggtggt gcgtaacggc aaaagcaccg 43 80 
ggcttactat gttggcactg atgagggtgt 4440 
aaggctgcac cggtgcgtca gcagaatatg 4500 
actgactcgc tacgctcggt cgttcgactg 4560 
ggagatttcc tggaagatgc caggaagata 4620 
agccgttttt ccataggctc cgcccccctg 4680 
agtggtggcg aaacccgaca ggactataaa 4740 
cgtgcgctct cctgttcctg cctttcggtt 4800 
tttgtctcat tccacgcctg acactcagtt . 4860 
gtatgcacga accccccgtt cagtccgacc 4920 
agtccaaccc ggaaagacat gcaaaagcac 49 80 
gaggagttag tcttgaagtc atgcgccggt 5040 
gactgcgctc ctccaagcca gttacctcgg 5100.. 
gaaaaaccgc cctgcaaggc ggttttttcg 5160 
aaacgatctc aagaagatca tcttattaat 5220 
tttatctctt caaatgtagc acctgaagtc 5280 
tgtttgacag cttatcatcg ataagcttta 5340 
acgcagtcag gcaccgtgta tgaaatctaa 5400 
caccctggat gctgtaggca taggcttggt 5460 
ttagtcatgc cccgcgccca ccggaaggag 5520 
ggtcgacgct ctcccttatg cgactcctgc 5580 
ccgttgagca ccgccgccgc aaggaatggt 5640 
tgtgaacatc atcacgttca tctttccctg 5700 
cgagaaggtc gcgaattcag gcgcttttta 5760 

5777 
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<220> 

<223> Description of the artificial sequence: Plasmid pOM22 
<400> 14 

aattcttaag aaggagatat acatatgacc ctgcagaaag cgcaagcgna gcgcattgag 60 
aaagagatct gggagctctc ccggttctcg gcggaaggcc ccggtgttac ccggctgacc 120 
tacactccag agcatgccgc cgcgcgggaa acgctcattg cggctatgga agcggccgct 180 
ttgagcgttc gtgaagacgc tctcgggaac atcatcggcc gacgtgaagg cactgatccg 240 
cagctccctg cgatcgcggt cggttcacac ttcgattctg tccgaaacgg cgggatgttc 300 
gatggcactg caggcgtggt gtgcgccctt gaggctgccc gggtgatgct ggagagcggc 360 
. tacgtgaatc ggcatccatt tgagttcatc gcgatcgtgg aggaggaagg ggcccgcttc 420 
agcagtggca tgttgggcgg ccgggccatt gcaggtttgg tcgccgacag ggaactggac 480 
tctttggttg atgaggatgg agtgtccgtt aggcaggcgg ctactgcctt cggcttgaag 540 
ccgggcgaac tgcaggctgc agcccgctcc gcggcggacc tgcgtgcttt tatcgaacta 600 
cacattgaac aaggaccgat cctcgagcag gagcaaatag agatcggagt tgtgacctcc 660 
atcgttggcg ttcgcgcatt gcgggttgct gtcaaaggca gaagcgcaca cgccggcaca 720 
acccccatgc acctgcgcca ggatgcgctg gtacccgccg ctctcatggt gcgggaggtc 780 
aaccggttcg tcaacgagat cgccgatggc acagtggcta ccgttggcca cctcacagtg 840 
gcccccggtg gcggcaacca ggtcccgggg gaggtggagt tcacactgga cctgcgttct 900 
ccgcatgagg agtcgctccg ggtgttgatc aaccgcatct cggtcatggt cggcgaggtc 960 
gcctcgcagg ccggtgtggc tgccgatgtg gatgaatttt tcaatctcag cccggtgcag 1020 
ctggctccta ccatggtgga cgccgttcgc gaagcggcct cggccctgca gttcacgcac 1080 
cgggatatca gcagtggggc gggccacgac tcgatgttca tcgcccaggt cacggacgtc 1140 
ggaatggttt tcgttccaag ccgtgctggc cggagccacg ttcccgaaga atggaccgat 1200 
ttcgatgacc ttcgcaaggg aactgaggtt gtcctccggg taatgaaggc acttgaccgg 1260 
ggatcccatc atcatcatca tcattgactg cagccaagct tctgttttgg cggatgagag 1320 
aagattttca gcctgataca gattaaatca gaacgcagaa gcggtctgat aaaacagaat 1380 
ttgcctggcg gcagtagcgc ggtggtccca cctgacccca tgccgaactc agaagtgaaa 1440 
cgccgtagcg ccgatggtag tgtggggtct ccccatgcga gagtagggaa ctgccaggca 1500 
tcaaataaaa cgaaaggctc agtcgaaaga ctgggccttt cgttttatct gttgtttgtc 1560 
ggtgaacgct ctcctgagta ggacaaatcc gccgggagcg gatttgaacg ttgcgaagca 1620 
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acggcccgga gggtggcggg caggacgccc 
gaaggccatc ctgacggatg gcctttttgc 
5 aaatacattc aaatatgtat ccgctcatga 
attgaaaaag gaagagtatg agtattcaac 
cggcattttg ccttcctgtt tttgctcacc 

10 

aagatcagtt gggtgcacga gtgggttaca 
ttgagagttt tcgccccgaa gaacgttttc 
15 gtggcgcggt attatcccgt gttgacgccg 
attctcagaa tgacttggtt gagtactcac 
tgacagtaag agaattatgc agtgctgcca 

20 

tacttctgac aacgatcgga ggaccgaagg 

atcatgtaac tcgccttgat cgttgggaac 

25 agcgtgacac cacgatgcct gtagcaatgg 

aactacttac tctagcttcc cggcaacaat 

caggaccact tctgcgctcg gcccttccgg 
30 • ^ ; ' ' ' 

ccggtgagcg tgggtctcgc ggtatcattg 

gtatcgtagt tatctacacg acggggagtc 

35 tcgctgagat aggtgcctca ctgattaagc 

atatacttta gattgattta aaacttcatt 

tttttgataa tctcatgacc aaaatccctt 

40 ^ - 

accccgtaga aaagatcaaa ggatcttctt 
gcttgcaaac aaaaaaacca ccgctaccag 
45 caactctttt tccgaaggta actggcttca 
tagtgtagcc gtagttaggc caccacttca 
ctctgctaat cctgttacca gtggctgctg 

50 

tggactcaag acgatagtta ccggataagg 
gcacacagcc cagcttggag cgaacgacct 
55 tatgagaaag cgccacgctt cccgaaggga 
gggtcggaac aggagagcgc acgagggagc 
gtcctgtcgg gtttcgccac ctctgacttg 



16 



gccatiaaacC 


gccaggcat^c 


aaa 1 1 aagca 


1680 


g^ttctacaa 


actcttttgt 


ttatttttct 


1740 


gacaataacc 


ctgataaatg 


cttcaataat 


1800 


atttccgtgt 


cgcccttatt 


cccttttttg 


1860 


cagaaacgct: 


ggt:aaaaat.a 


aaagatgct:g 


1920 


tcgaactgga 


tctcaacagc 


ggtiaagat^cc 


1980 


caat.gat:gag 


cacttttaaa 


gttctgctat 


2040 




actcQcitccjc 


cgcatacact 


2100 


cagt^cacaga 


aaagcat^ctz^^ 


accrcra CcTGfca 


2160 


taaccatiaaa 


tcrataacact 


Qcacrccaac^ 

^jl Xp* ^^^A 


2220 




ttttttacac 


aacaticfcrcrcfci 


2280 


c aaaac t craa 


t: craacf c ca ti a 


c caaaccraccr 


2340 




crccrcaaactia 


t fcaa c t crcf ccr 


2400 


taatacractcr 


cr a ti cr cracrcf c CI 


era taaacit: tcr 


2460 


c t CTcr c t crcr 1 1 

S» 3j S ^ «S ** 


tattcrctcrati 


aaa ti c ti crcf acr 


2520 


c!a.cf ca.c t cro'cr 


accacratcjCTt 


aagcccCccc 


2580 


acrcrcaac ta^ 


ocra ticraaccra 


aatiagacaga 


2640 


attggtaact 


gtcagaccaa 


gtt tact cat 


2700 


tttaatttaa 


aaggatctag 


gtgaagatcc 


2760 


aacg^gag t ti 


tit:cgCtccac 


tcracicQtcacr 


2820 


gagatcctt^ 


tt^tctgcgc 


gtaatctgct 


2880 


c era b crcr t ^ t o 


t^tocccTGrat 

w w 23 ^9 ^ 


caagagc tac 


2940 


CTcaaaococa 


gataccaaat: 


actgtccttc 


3000 


agaactctgt 


agcaccgcct. 


acatacctcg 


3060 


ccacrt-crcrccTa 


t.aagCcgt.g^ 


cttaccgggt 


3120 


cgcagcggtc 


gggctgaacg 


gggggttcgt 


3180 


acaccgaact 


gagataccta 


cagcgtgagc 


3240 


gaaaggcgga 


caggtatccg 


gtaagcggca 


3300 


ttccaggggg 


aaacgcctgg 


tatctttata 


3360 


agcgtcgatt 


tttgtgatgc 


tcgtcagggg 


3420 
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ggcggagcct atggaaaaac gccagcaacg 
^ ggccttttgc tcacatgttc tttcctgcgt 
ccgcctttga gtgagctgat accgctcgcc 
tgagcgagga agcggaagag cgcctgatgc 
10 tttcacaccg catatatggt gcactctcag 
ccagtataca ctccgctatc gctacgtgac 
acacccgctg acgcgccctg acgggcttgt 

15 

gtgaccgtct ccgggagctg catgtgtcag 
aggcagctgc ggtaaagctc atcagcgtgg 
20 tcatccgcgt ccagctcgtt gagtttctcc 
cgggccatgt taagggcggt tttttcctgt 
aatttctgtt catgggggta atgataccga 

25 

ttactgatga tgaacatgcc cggttactgg 
ggatgcggcg ggaccagaga aaaatcactc 
30 atgtaggtgt tccacagggt agccagcagc 
tgcagggcgc tgacttccgc gtttccagac 
atgttgttgc tcaggtcgca gacgttttgc 

35 

tcggtgattc attctgctaa ccagtaaggc 
acaggagcac gatcatgcgc acccgtggcc 
40 tgcggctgct ggagatggcg gacgcgatgg 
tcacagttct ccgcaagaat tgattggctc 
ggtgccgccg gcttccattc aggtcgaggt 

45 

ggggaggcag acaaggtata gggcggcgcc 
cgccgaggcg gcataaatcg ccgtgacgat 
50 aagagccgcg agcgatcctt gaagctgtcc 
catggcctgc aacgcgggca tcccgatgcc 
ggccatccag cctcgcgtcg cgaacgccag 

55 

gccggcgata atggcctgct tctcgccgaa 
ttgagcgagg gcgtgcaaga ttccgaatac 
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cggccttttt acggttcctg gccttttgct 3480 
tatcccctga ttctgtggat aaccgtatta 3540 
gcagccgaac gaccgagcgc agcgagtcag 3600 
ggtattttct ccttacgcat ctgtgcggta 3 660 
tacaatctgc tctgatgccg catagttaag 3720 
tgggtcatgg ctgcgccccg acacccgcca 3780 
ctgctcccgg catccgctta cagacaagct 3840 
aggttttcac cgtcatcacc gaaacgcgcg 3900 
tcgtgaagcg attcacagat gtctgcctgt 3960 
agaagcgtta atgtctggct tctgataaag 4020 
ttggtcactt gatgcctccg tgtaaggggg 4080 
tgaaacgaga gaggatgctc acgatacggg 4140 
aacgttgtga gggtaaacaa ctggcggtat 4200 
agggtcaatg ccagcgcttc gttaatacag 4260 
atcctgcgat gcagatccgg aacataatgg 4320 
ttbacgaaac acggaaaccg aagaccattc 4380 
agcagcagtc gcttcacgtt cgctcgcgta 4440 
aaccccgcca gcctagccgg gtcctcaacg 4500 
aggacccaac gctgcccgag atgcgccgcg 4560 
atatgttctg ccaagggttg gtttgcgcat 4620 
caattcttgg agtggtgaat ccgttagcga 4680 
ggcccggctc catgcaccgc gacgcaacgc 4740 
tacaatccat gccaacccgt tccatgtgct 4800 
cagcggtcca gtgatcgaag ttaggctggt 4860 
ctgatggtcg tcatctacct gcctggacag 4920 
gccggaagcg agaagaatca taatggggaa 4980 
caagacgtag cccagcgcgt cggccgccat 5040 
acgtttggtg gcgggaccag tgacgaaggc 5100 
cgcaagcgac aggccgatca tcgtcgcgct 5160 
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ccagcgaaag cggtcctcgc cgaaaatgac ccagagcgct gccggcacct gtcctacgag 5220 
ttgcatgata aagaagacag tcataagtgc ggcgacgata gtcatgcccc gcgcccaccg 5280 
gaaggagctg actgggttga aggctctcaa gggcatcggt cgacgctctc ccttatgcga 5340 
ctcctgcatt aggaagcagc ccagtagtag gttgaggccg ttgagcaccg ccgccgcaag 5400 
gaatggtgca tgcatcgatc accacaattc agcaaattgt gaacatcatc acgttcatct 5460 
ttccctggtt gccaatggcc. cattttcctg tcagtaacga gaaggtcgcg aattcaggcg 5520 
ctttttagac tggtcgtaat gaacaattct taagaaggag atatacatat gtttgacgta 5580 
atagttaaga actgccgtat ggtgtccagc gacggaatca ccgaggcaga cattctggtg 5640 
aaagacggca aagtcgccgc aatcagctcg gacacaagtg atgttgaggc gagccgaacc 5700 
attgacgcgg gtggcaagtt cgtgatgccg ggcgtggtcg atgaacatgt gcatatcatc 5760 
gacatggatc tgaagaaccg gtatggccgc ttcgaactcg attccgagtc tgcggccgtg 5820 
ggaggcatca ccaccatctt tgagatgccg tttaccttcc cgcccaccac cactttggac 5880 
gccttcctcg aaaagaagaa gcaggcgggg cagcggttga aagttgactt cgcgctctat 5940 
ggcggtggag tgccgggaaa cctgcccgag atccgcaaaa tgcacgacgc cggcgcagtg 6000 
ggcttcaagt caatgatggc agcctcagtt ccgggcatgt tcgacgccgt cagcgacggc 6060 
gaactgttcg aaatcttcca ggagatcgca gcctgtggtt cagtcgccgt ggtccatgcc 6120 
gagaatgaaa cgatcattca agcgctccag aagcagatca aagccgctgg tcgcaaggac 6180 
atggccgcct acgaggcatc ccaaccagtt ttccaggaga acgaggccat tcagcgtgcg 6240 
ttactactgc agaaagaagc cggctgtcga ctgattgtgc ttcacgtgag caaccctgac 6300 
ggggtcgagc tgatacatcg ggcgcaatcc gagggccagg acgtccactg cgagtcgggt 6360 
ccgcagtatc tgaatatcac cacggacgac gccgaacgaa tcggaccgta tatgaaggtc 6420 
gcgccgcccg tccgctcagc cgagatgaac gtcagattat gggaacaact tgagaacggg 6480 
ctcatcgaca cccttgggtc agaccacggc ggacatcctg tcgaggacaa agaacccggc 6540 
tggaaggacg tgtggaaagc cggcaacggt gcgctgggcc ttgagacatc cctgcctatg 6600 
atgctgacca acggagtgaa taaaggcagg ctatccttgg aacgcctcgt cgaggtgatg 6660 
tgcgagaaac ctgcgaagct ctttggcatc tatccgcaga agggcacgct acaggttggt 6720 
tccgacgccg atctgctcat cctcgatctg gatattgaca ccaaagtgga tgcctcgcag 6780 
ttccgatccc tgcataagta cagcccgttc gacgggatgc ccgtcacggg tgcaccggtt 6840 
ctgacgatgg tgcgcggaac ggtggtggca gagaagggag aagttctggt cgagcaggga 6900 
ttcggccagt tcgtcacccg tcacgactac gaggcgtcga agtgaggatc tcgacgctct 6960 
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cccttatgcg actcctgcat taggaagcag cccagtagta ggttgaggcc gttgagcacc 7020 

gccgccgcaa ggaatggtgc atgcatcgat caccacaatt cagcaaattg tgaacatcat 7080. 

5 . • 

cacgttcatc tttccctggt tgccaatggc ccattttcct gtcagtaacg agaaggtcgc 7140 
gaattcaggc gctttttaga ctggtcgtaa tgaac 7175 

10 

<210> 15 
<211> 5989 
<212> DNA 

<213> Artificial sequence 
15 . 
<220> 

<223> Description of the artificial sequence : Plasmid pDHYH 
<400> 15 

20 aattcttaag aaggagatat acatatggat gcaaagctac tggttggcgg cactattgtt 60 
tcctcgaccg gcaaaatccg agccgacgtg ctgattgaaa acggcaaagt cgccgctgtc 120 
ggcatgctgg acgccgcgac gccggacaca gttgagcggg ttgactgcga cggcaaatac 180 

25 

gtcatgcccg gcggtatcga cgttcacacc cacatcgact cccccctcat ggggaccacc 240 
accgccgatg attttgtcag cggaacgatt gcagccgcta ccggcggaac aacgaccatc 300 
30 gtcgatttcg gacagcagct cgccggcaag aacctgctgg aatccgcaga cgcgcaccac 360 
aaaaaggcgc aggggaaatc cgtcattgat tacggcttcc atatgtgcgt gacgaacctc 42 0 
tatgacaatt tcgattccca tatggcagaa ctgacacagg acggaatctc cagtttcaag 480 

35 

gtcttcatgg cctaccgcgg aagcctgatg atcaacgacg gcgaactgtt cgacatcctc 540 
aagggagtcg gctccagcgg tgccaaacta tgcgtccacg cagagaacgg cgacgtcatc 600 
40 gacaggatcg ccgccgacct ctacgcccaa ggaaaaaccg ggcccgggac ccacgagatc 660 
gcacgcccgc cggaatcgga agtcgaagca gtcagccggg ccatcaagat ctcccggatg 720 
gccgaggtgc cgctgtattt cgtgcatctt tccacccagg gggccgtcga ggaagtagct 7 80 

45 

gccgcgcaga tgacaggatg gccaatcagc gccgaaacgt gcacccacta cctgtcgctg 840 
agccgggaca tctacgacca gccgggattc gagccggcca aagctgtcct cacaccaccg 900 
50 ctgcgcacac aggaacacca ggacgcgttg tggagaggca ttaacaccgg tgcgctcagc 960 
gtcgtcagtt ccgaccactg ccccttctgc tttgaggaaa agcagcggat gggggcagat 1020 
gacttccggc agatccccaa cggcgggccc ggcgtggagc accgaatgct cgtgatgtat 1080 

55 

gagaccggtg tcgcggaagg aaaaatgacg atcgagaaat tcgtcgaggt gactgccgag 1140 
aacccggcca agcaattcga tatgtacccg aaaaagggaa caattgcacc gggctccgat 1200 
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gcagacatca tcgtggtcga ccccaacgga acaaccctca tcagtgccga cacccaaaaa 1260 
caaaacatgg actacacgct gttcgaaggc ttcaaaatcc gttgctccat cgaccaggtg 1320 
ttctcgcgtg gcgacctgat cagcgtcaaa ggcgaatatg tcggcacccg cggccgcggc 13 80 
gaattcatca agcggagcgc ttggagccac ccgcagttcg aaaaataaaa gcttggctgt 1440 
tttggcggat gagagaagat tttcagcctg atacagatta aatcagaacg cagaagcggt 1500 
ctgataaaac agaatttgcc . tggcggcagt agcgcggtgg tcccacctga ccccatgccg .1560 
aactcagaag tgaaacgccg tagcgccgat ggtagtgtgg ggtctcccca tgcgagagta 1620 
gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg cctttcgttt 1680 
tatctgttgt ttgtcggtga acgctctcdt gagtaggaca aatccgccgg ^^gcggattt 1740 
gaacgttgcg aagcaacggc ccggagggtg gcgggcagga cgcccgccat aaactgccag 1800 
gcatcaaatt aagcagaagg ccatcctgac ggatggcctt tttgcgtttc tacaaactct 1860 
tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat 1920 
aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc 1980 
ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga 2040 
aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca 2100 
acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt 2160 
ttaaagttct gctatgtggc . gcggtattat cccgtgttga cgccgggcaa gagcaac teg 2220 
gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc 2280 
atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata 2340 
acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt 2400 
tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag 2460 
csracgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca 2520 
aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactggatgg 2580 
aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg 2640 
ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag 2700 
atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg 2760 
aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag 2820 
accaagttta ctcatatata ctttagattg atittaaaact tcatttttaa tttaaaagga 2880 
tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt 2940 
tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc 3000 
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tgcgcgtaat ctgctgcttg 



cggatcaaga gctaccaact 

5 • 

caaatactgt ccttctagtg 



cgcctacata cctcgctctg 
10 cgtgtcttac. cgggttggac 
gaacgggggg ttcgtgcaca 



acctacagcg tgagctatga 

15 

atccggtaag cggcagggtc 



cctggtatct ttatagtcct 
20 gatgctcgtc aggggggcgg 



tcctggcctt ttgctggcct 



tggataaccg tattaccgcc 

25 

agcgcagcga gtcagtgagc 



cgcatctgtg cggtatttca 
30 tgccgcatag ttaagccagt 



ccccgacacc cgccaacacc 



gcttacagac aagctgtgac 

35 

tcaccgaaac gcgcgaggca 



cagatgtctg cctgttcatc 



40 tggcttctga taaagcgggc 



ctccgtgtaa gggggaattt 



tgctcacgat acgggttact 

45 

aacaactggc ggtatggatg 



gcttcgttaa tacagatgta 



50 tccggaacat aatggtgcag 



aaccgaagac cattcatgtt 



acgttcgctc gcgtatcggt 

55 

gccgggtcct caacgacagg 



ccgagatgcg ccgcgtgcgg 





21 


caaacaaaaa 


aaccaccgct 


ctttttccga 


aggtaactgg 


tagccgtagt 


taggccacca 


ctaatcctgt 


taccagtggc 


tcaagacgat 


agttaccgga 


cagcccagct 


tggagcgaac 


gaaagcgcca 


cgcttcccga 


ggaacaggag 


agcgcacgag 


gtcgggtttc 


gccacctctg 


agcctatgga 


aaaacgccag 


tttgctcaca 


tgttctttcc 


tttgagtgag 


ctgataccgc 


gaggaagcgg 


aagagcgcct 


caccgcatat 


atggtgcact 


atacactccg 


ctatcgctac 


cgctgacgcg 


ccctgacggg 


cgtctccggg 


agctgcatgt 


gctgcggtaa 


agctcatcag 


cgcgtccagc 


tcgttgagtt 


catgttaagg 


gcggtttttt 


ctgttcatgg 


gggtaatgat 


gatgatgaac 


atgcccggtt 


cggcgggacc 


agagaaaaat 


ggtgttccac 


agggtagcca 


ggcgctgact 


tccgcgtttc 


gttgctcagg 


tcgcagacgt 


gattcattct 


gctaaccagt 


agcacgatca 


tgcgcacccg 


ctgctggaga 


tggcggacgc 



accagcggtg gtttgtttgc 3060 
cttcagcaga gcgcagatac 3120 
cttcaagaac tctgtagcac 3180 
tgctgccagt ggcgataagt 3240 
taaggcgcag cggtcgggct 3300 
gacctacacc gaactgagat 3360 
agggagaaag gcggacaggt 3420 
ggagcttcca gggggaaacg 3480 
acttgagcgt cgatttttgt 3 540 
caacgcggcc tttttacggt 3600 
tgcgttatcc cctgattctg 3660 
tcgccgcagc cgaacgaccg 3720 
gatgcggtat tttctcctta 3780 
ctcagtacaa tctgctctga 3840 
gtgactgggt catggctgcg 3900 
cttgtctgct cccggcatcc 3960 
gtcagaggtt ttcaccgtca 4020 
cgtggtcgtg aagcgattca 4080 
tctccagaag cgttaatgtc 4140 
cctgtttggt cacttgatgc 4200 
accgatgaaa cgagagagga 4260 
actggaacgt tgtgagggta 4320 
cactcagggt caatgccagc 4380 
gcagcatcct gcgatgcaga 4440 
cagactttac gaaacacgga 4500 
tttgcagcag cagtcgcttc 4560 
aaggcaaccc cgccagcctia 4620 
tggccaggac ccaacgctgc 4680 
gatggatatg ttctgccaag 4740 
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ggttggtttg 


cgcattcaca 


gttctccgca 


agaattgatt 


ggctccaatt 


cttggagtgg 


4800 


tgaatccgtt 


agcgaggtgc 


cgccggcttc 


cattcaggtc 


gaggtggccc 


ggctccatgc 


4860 


accgcgacgc 


aacgcgggga 


ggcagacaag 


gtatagggcg 


gcgcctacaa 


tccatgccaa 


4920 


cccgttccat 


gtgctcgccg 


aggcggcata 


aatcgccgtg 


acgatcagcg. gtccagtgat 


4980 


cgaagttagg 


ctggtaagag 


ccgcgagcga 


tccttgaagc 


tgtccctgat 


ggtcgtcatc 


5040 


tacctgcctg 


gacagcatgg 


cctgcaacgc 


gggcatcccg 


atgccgccgg 


aagcgagaag 


.5100 


aatcataatg 


gggaaggcca 


tccagcctcg 


cgtcgcgaac 


gccagcaaga 


cgtagcccag 


5160 


cgcgtcggcc 


gccatgccgg 


cgataatggc 


ctgcttctcg 


ccgaaacgtt 


tggtggcggg 


5220 


accagtgacg 


aaggcttgag 


cgagggcgtg 


caagattccg 


aataccgcaa 


gcgacaggcc 


5280 


gatcatcgtc 


gcgc tccagc 


gaaagcggtc 


c t cgc cgaa a 


atgacccaga 


gcgctgccgg 


5340 


cacctgtcct 


acgagttgca 


tgatiaaagaa 


gacagtcata 


agtgcggcga 


cgatagtcat 


5400 


gccccgcgcc 


caccggaagg 


agctgactgg 


gttgaaggct 


ctcaagggca 


tcggtcgacg 


5460 


ctctccctta 


tgcgactcct 


gcattaggaa 


gcagcccagt 


aatacrcrt tcra 




5520 


caccgccgcc 


gcaaggaatg 


gtgcatgctc 


gatggctacg 


agggcagaca 


gtaagtggat 


5580 


ttaccataat 


cccttaattg 


tacgcaccgc 


Caaaacgcgt: 


tcagcgcgat 


cacggcagca 


5640 


gacaggtaaa 


aatggcaaca 


aaccacccta 


aaaactgcgc 


gatcgcgcct 


gataaatittit 


5700 


aaccgtatga 


atacctatgc aaccagaggg 


tacaggccac 


attaccccca 


cttaatccac 


5760 


tgaagctgcc 


atttttcatg 


gtttcaccat 


cccagcgaag 


ggccatgcat 


gcatcgaaat 


5820 


taatacgacg 


aaattaatac 


gactcactat 


agggcaattg 


cgatcaccac 


aattcagcaa 


5880 


attgtgaaca 


tcatcacgtt 


catctttccc 


tggttgccaa 


tggcccattt 
gtaatgaac 


tcctgtcagt 


5940 


aacgagaagg 


tcgcgaattc 


aggcgctttt 


tagactggtc 




5989 



<210> 16 
<211> 6958 

<212> DNA - • 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence : Plasmid 
PJAVIER16 

<400> 16 

aattcttaag aaggagatat acatatggcg aaaaacttga tgctcgcggt cgctcaagtc 60 
ggcggtatcg atagttcgga atcaagaccc gaagtcgtcg cccgcttgat tgccctgctg 120 
gaagaagcag cttcccaggg cgcggaactg gtggtctttc ccgaactcac gctgaccacg 180 
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ttcttcccgc gtacctggtt cgaagaaggc 

cccaatgacg acgtcgcgcc ccttttcgaa 

5 ctcggatacg cggaactgac cagtgatgag 

aagcacggcg acatcgtcgg caagtaccgc 

cgggaaggac tacccaacca gcaccttgaa 

10 

ttcggtgtct tcgacttcca cggcgtgcag 

tggccggagg tctaccgctc tttggccctg 

15 aacacccccg atttcgttcc cggctggcag 

cttctttcac ttcaggcagg ggcataccag 

tcgggcttcg aagacgggca ccacatgatc 

20 

gaaatcctgg caaaagcagc cggcgagggc 

gacatgggca agccctataa ggaaagcgtc 

25 gcatacggca tcatcgccga aaggaaaggg 

gtgaatgact aaggatccga aggagatata 

actattgttt cctcgaccgg caaaatccga 

30 

gccgctgtcg gcatgctgga cgccgcgacg 

ggcaaatacg tcatgcccgg cggtatcgac 

35 gggaccacca ccgccgatga ttttgtcagc 

acgaccatcg tcgatttcgg acagcagctc 

gcgcaccaca aaaaggcgca ggggaaatcc 

40 

acgaacctct atgacaattt cgattcccat 

agtttcaagg tcttcatggc ctaccgcgga 

45 gacatcctca agggagtcgg ctccagcggt 

gacgtcatcg acaggatcgc cgccgacctc 

cacgagatcg cacgcccgcc ggaatcggaa 

50 

tcccggatgg ccgaggtgcc gctgtatttc 

gaagtagctg ccgcgcagat gacaggatgg 

55 ctgtcgctga gccgggacat ctacgaccag 

acaccaccgc tgcgcacaca ggaacaccag 

gcgctcagcg tcgtcagttc cgaccactgc 
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gacttcgagg aatacttcga taaatccatg 240 
cgcgccaaag accttggcgt gggcttctac 300 
aagcggtaca acacatcaat tctggtgaac 360 
aagatgcatc tgccgggcca cgccgataac 420 
aagaaatact tccgcgaagg agatctcgga 480. 
Sftcggaatgt gtctctgcaa cgaccggcga 540 
cagggagcag agctcgtcgt cctgggctac 600 
gaagagcctc acgcgaagat gttcacgcac 660 
aactcggtat ttgtggcggc tgccggcaag 720 
ggcggatcag cggtcgccgc gcccagcggc 780 
gatgaagtcg tcgttgtgaa agcagacatc 840 
ttcgacttcg ccgcccatcg gcgccccgac 900 
cggggcgccc cactgcccgt cccgttcaac 960 
catatggatg caaagctact ggttggcggc 1020 
gccgacgtgc tgattgaaaa. cggcaaagtc 1080 
ccggacacag ttgagcgggt tgactgcgac 1140 
gttcacaccc acatcgactc ccccctcatg 1200 
ggaacgattg cagccgctac cggcggaaca 1260 
gccggcaaga acctgctgga atccgcagac 1320 
gtcattgatt acggcttcca tatgtgcgtg 1380 
atggcagaac tgacacagga cggaatctcc 1440 
agcctgatga tcaacgacgg cgaactgttc 1500 
gccaaactat gcgtccacgc agagaacggc 1560 
tacgcccaag gaaaaaccgg gcccgggacc 1620 
gtcgaagcag tcagccgggc catcaagatc 1680 
gtgcatcttt ccacccaggg ggccgtcgag 1740 
ccaatcagcg ccgaaacgtg cacccactac 1800 
ccgggattcg agccggccaa agctgtcctc 1860 
gacgcgttgt ggagaggcat taacaccggt 1920 
cccttctgct ttgaggaaaa gcagcggatg 1980 
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ggggcagatg acttccggca gatccccaac ggcgggcccg gcgtggagca ccgaatgctc 2040 
gtgatgtatg agaccggtgt cgcggaagga aaaatgacga tcgagaaatt cgtcgaggtg 2100 
actgccgaga acccggccaa gcaattcgat atgtacccga aaaagggaac aattgcaccg 2160 
ggctccgatg cagacatcat cgtggtcgac cccaacggaa caaccctcat cagtgccgac 2220 
acccaaaaac aaaacatgga ctacacgctg ttcgaaggct tcaaaatccg ttgctccatc 22 80 
gaccaggtgt tctcgcgtgg cgacctgatc agcgtcaaag gcgaatatgt cggcacccgc 2340 
ggccgcggcg aattcatcaa gcggagcgct tggagccacc cgcagttcga aaaataaaag 2400 
cttggctgtt ttggcggatg agagaagatt ttcagcctga tacagattaa atcagaacgc 2460 
agaagcggtc tgataaaaca gaatttgcct ggcggcagta gcgcggtggt cccacctgac 2520 
cccatgccga actcagaagt gaaacgccgt agcgccgatg gtagtgtggg gtctccccat 2 580 
gcgagagtag ggaactgcca ggcatcaaat aaaacgaaag gctcagtcga aagactgggc 2640 
ctttcgtttt atctgttgtt tgtcggtgaa cgctctcctg agtaggacaa atccgccggg 2700 
agcggatttg aacgttgcga agcaacggcc cggagggtgg cgggcaggac gcccgccata 2760 
aactgccagg catcaaatta agcagaaggc catcctgacg gatggccttt ttgcgtttct 2820 
acaaactctt ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat 2880 
aaccctgata aatgcttcaa taatattgaa aaaggaagag . tatgagtatt . caacatttcc 2940 
gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa 30.00 
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac 3060 
tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga 3120 
tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtgttgac gccgggcaag 3180 
agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca 3240 
cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca 3300 
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa 3360 
ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc 3420 
tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa 3480 
cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag 3540 
actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct 3600 
ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac 3660 
tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa 3720 
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ctatggatga acgaaataga cagatcgctg 
aactgtcaga ccaagtttac tcatatatac 
5 ttaaaaggat ctaggtgaag atcctttttg 
agttttcgtt ccactgagcg tcagaccccg 
ctttttttct gcgcgtaatc tgctgcttsrc. 

10 

tttgtttgcc ggatcaagag ctaccaactc 
cgcagatacc aaatactgtc cttctagtgt 
15 ctgtagcacc. gcctacatac ctcgctctgc 
gcgataagtc gtgtcttacc gggttggact 
ggtcgggctg aacggggggt tcgtgcacac 

20 

aactgagata cctacagcgt gagctatgag 
cggacaggta tccggtaagc ggcagggtcg 
25 ggggaaacgc ctggtatctt tatagtcctg 
gatttttgtg atgctcgtca ggggggcgga 
ttttacggtt cctggccttt tgctggcctt 

30. 

ctgattctgt ggataaccgt attaccgcct 
gaacgaccga gcgcagcgag tcagtgagcg 
35 ttctccttac gcatctgtgc ggtatttcac 
ctgctctgat gccgcatagt taagccagta 
atggctgcgc cccgacaccc gccaacaccc 

40 

ccggcatccg cttacagaca agctgtgacc 
tcaccgtcat caccgaaacg cgcgaggcag 
45 agcgattcac agatgtctgc ctgttcatcc 
gttaatgtct ggcttctgat aaagcgggcc 
acttgatgcc tccgtgtaag ggggaatttc 

50 

gagagaggat gctcacgata cgggttactg 
gtgagggtaa acaactggcg gtatggatgc 
55 aatgccagcg cttcgttaat acagatgtag 
cgatgcagat ccggaacata atggtgcagg 
aaacacggaa accgaagacc attcatgttg 
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agataggtgc 


ctcactgatt 


aagcattggt 


3780 


tttagattga 


tttaaaactt 


catttttaat 


3840 


ataatctcat 


gaccaaaatc 


ccttaacgtg 


3900 


tagaaaagat 


caaaggatct 


tcttgagatc 


3960 


aaacaaaaaa 


accaccgcta 


ccagcggtgg 


4020 


tttttccgaa 


ggtaactggc 


ttcagcagag 


4080 


agccgtagtt 


aggccaccac 


ttcaagaact 


4140 


taatcctgtt 


accagtggct 


gctgccagtg 


4200 


caagacgata 


gttaccggat 


aaggcgcagc 


4260 


agcccagctt 


ggagcgaacg 


acctacaccg 


4320 


aaagcgccac 


gcttcccgaa 


gggagaaagg 


4380 


gaacaggaga 


gcgcacgagg 


gagcttccag 


4440 


tcgggtttcg 


ccacctctga 


cttgagcgtc 


4500 


gcctatggaa 


aaacgccagc 


aacgcggcct 


4560 


ttgctcacat 


gttctttcct 


gcgttatccc 


4620 


ttgagtgagc 


tgataccgct 


cgccgcagcc 


4680 


aggaagcgga 


agagcgcctg 


atgcggtatt 


4740 


accgcatata 


tggtgcactc 


tcagtacaat 


4800 


tacactccgc 


tatcgctacg 


tgactgggtc 


4860 


gctgacgcgc 


cctgacgggc 


ttgtctgctc 


4920 


gtctccggga 


gctgcatgtg 


tcagaggttt 


4980 


ctgcggtaaa 


gctcatcagc 


gtggtcgtga 




gcgtccagct 


cgttgagttt 


c t c c agaagc 


5100 


atgttaaggg 


cggttttttc 


ctgtttggtc 


,5160 


tgttcatggg 


ggtaatgata 


ccgatgaaac 


5220 


atgatgaaca 


tgcccggtta 


ctggaacgtt 


5280 


ggcgggacca 


gagaaaaatic 


actcagggtc 


5340 


gtgttccaca 


gggtagccag 


cagcatcctg 


5400 


gcgctgactt 


ccgcgtttcc 


agactttacg 


5460 


ttgctcaggt 


cgcagacgtt 


ttgcagcagc 


5520 
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agtcgcttca 


cgttcgctcg 


cgtatcggtg 


attcattctg 


ctaaccagta 


aggcaacccc 


5580 


gccagcctag 


ccgggtcctc 


aacgacagga 


gcacgatcat 


gcgcacccgt 


ggccaggacc 


5640 


caacgctgcc 


cgagatgcgc 


cgcgtgcggc 


tgctggagat. ggcggacgcg atggatatgt 


5700 


tctgccaagg 


gttggtttgc 


gcattcacag 


ttctccgcaa 


gaattgattg 


gctccaattc 


5760 


ttggagtggt 


gaatccgtta 


gcgaggtgcc 


gccggcttcc 


attcaggtcg 


aggtggcccg 


5820 


gctccatgca 


ccgcgacgca 


acgcggggag 


gcagacaagg 


tatagggcgg 


cgcctacaat 


5880 


ccatgccaac 


ccgttccatg 


tgctcgccga ggcggcataa 


atcgccgtga 


cgatcagcgg 


5940 


tccagtgatc 


gaagttaggc 


tggtaagagc 


cgcgagcgat 


ccttgaagct 


gtccctgatg 


6000 


gtcgtcatct 


acctgcctgg 


acagcatggc 


ctgcaacgcg 


ggcatcccga 


tgccgccgga 


6060 


agcgagaaga 


atcataatgg 


ggaaggccat 


ccagcctcgc 


gtcgcgaacg 


ccagcaagac 


6120 


gtagcccagc 


gcgtcggccg 


ccatgccggc 


gataatggcc 


tgcttctcgc 


cgaaacgttt 


6180 


ggtggcggga 


ccagtgacga 


aggcttgagc 


gagggcgtgc 


aagattccga 


ataccgcaag 


6240 


cgacaggccg 


atcatcgtcg 


cgctccagcg 


aaagcggtcc 


tcgccgaaaa 


tgacccagag 


6300 


cgctgccggc 


acctgtccta 


cgagttgcat 


gataaagaag 


acagtcataa 


gtgcggcgac 


6360 


gatagtcatg 


ccccgcgccc 


accggaagga 


gctgactggg 


ttgaaggctc 


tcaagggcat 


6420 


cggtcgacgc 


tctcccttat 


gcgactcctg 


cattaggaag 


cagcccagta 


gtaggttgag 


6480 


gccgttgagc 


accgccgccg 


caaggaatgg 


tgcatgctcg 


atggctacga 


gggcagacag 


6540 


taagtggatt 


taccataatc 


ccttaattgt 


acgcaccgct 


aaaacgcgtt 


cagcgcgatc 


6600 


acggcagcag 


acaggtaaaa 


atggcaacaa 


accaccctaa 


aaactgcgcg 


atcgcgcctg 


6660 


ataaatttta 


accgtatgaa 


tacctatgca 


accagagggt 


acaggccaca 


ttacccccac 


6720 


ttaatccact 


gaagctgcca 


tttttcatgg 


tttcaccatc 


ccagcgaagg 


gccatgcatg 


6780 


catcgaaatt 


aatacgacga 


aattaatacg actcactata 


gggcaattgc 


gatcaccaca 


6840 


attcagcaaa 


ttgtgaacat 


catcacgttc 


atctttccct 


ggttgccaat 


ggcccatttt 


6900 


cctgtcagta 


acgagaaggt 


cgcgaattca 


ggcgcttttt 


agactggtcg 


taatgaac 


6958 
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